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Figure 1: Nucleotide and deduced amino acid sequence. of CPN100634 

gataaaaatt cttgacagct gttttgtcat ctttaacttg atttacttat tttgtttcta 60 

tattgatgcg aatagttctc taaaaaacaa aagcattacc atg aag act teg att 115 

Met Lys Thr Ser lie 

1.5 

cct tgg gtt tta gtt tec tec gtg tta get ttc tea tgt cac eta cag 163 
Pro Trp Val Leu Val Ser Ser Val Leu Ala Phe Ser Cys His Leu Gin 
10 15 20 

tea eta get aac gag gaa ctt tta tea cct gat gat age ttt aat gga 211 
Ser Leu Ala Asn Glu Glu Leu Leu Ser Pro Asp Asp Ser Phe Asn Gly 
25 30 35 

aat ate gat tea grga acg ttt act cca aaa act tea gee aca aca tat 259 
Asn lie Asp Ser Gly Thr Phe Thr Pro Lys Thr Ser Ala Thr Thr Tyr 
40 45 50 

tct eta aca gga gat gtc ttc ttt tac gag cct gga aaa ggc act ccc 307 
Ser Leu Thr Gly Asp Val Phe Phe Tyr Glu Pro Gly Lys Gly Thr Pro 
55 60 65 

tta tct gac agt tgt ttt aag caa ace acg gac aat ctt acc ttc ttg 355 
Leu Ser Asp Ser Cys Phe Lys Gin Thr Thr Asp Asn Leu Thr Phe Leu 
70 75 80 85 

ggg aac ggt cat age tta acg ttt ggc ttt ata gat get ggc act cat 403 
Gly Asn Gly His Ser Leu Thr Phe Gly Phe lie Asp Ala Gly Thr His 
90 95 100 

gca ggt get get gca tct aca aca gca aat aag aat ctt acc ttc tea 451 
Ala Gly Ala Ala Ala Ser Thr Thr Ala Asn Lys Asn Leu Thr Phe Ser 
105 110 115 

ggg ttt tec tta ctg agt ttt gat tec tct cct age aca acg gtt act 499 
Gly Phe Ser Leu Leu Ser Phe Asp Ser Ser Pro Ser Thr Thr Val Thr 
120 125 130 

aca ggt cag gga acg ctt tec tea gca gga ggc gta aat tta gaa aat 547 
Thr Gly Gin Gly Thr Leu Ser Ser Ala Gly Gly Val Asn Leu Glu Asn 
135 140 145 

att cgt aaa ctt gta gtt get ggg aat ttt tct act gca gat ggt gga 595 
lie Arg Lys Leu Val Val Ala Gly Asn Phe Ser Thr Ala Asp Gly Gly 
150 155 160 165 

get ate aaa gga gcg tct ttc ctt tta act ggc act tct gga gat get 643 
Ala lie Lys Gly Ala Ser Phe Leu Leu Thr Gly Thr Ser Gly Asp Ala 
170 175 180 

ctt ttt agt aac aac tct tea tea aca aag gga gga gca att get act 691 
Leu Phe Ser Asn Asn Ser Ser Ser Thr Lys Gly Gly Ala He Ala Thr 
185 190 195 
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aca gca ggc get cgc ata gca aat aac aca ggt tat gtt aga ttc eta 
Thr Ala Gly Ala Arg lie Ala Asn Asn Thr Gly Tyr Val Arg Phe Leu 
200 205 210 



739 



tct aac ata gcg tct acg tea gga ggc get ate gat gat gaa ggc acg 
Ser Asn lie Ala Ser Thr Ser Gly Gly Ala lie Asp Asp Glu Gly Thr 
215 220 225 



787 



teg ata eta teg aac aac aaa ttt eta tat ttt gaa ggg aat gca gcg 
Ser lie Leu Ser Asn Asn Lys Phe Leu Tyr Phe Glu Gly Asn Ala Ala 
230 235 240 245 



835 



aaa act act ggc ggt gcg ate tgc aac acc aag gcg agt gga tct cct 
Lys Thr Thr Gly Gly Ala He Cys Asn Thr Lys Ala Ser Gly Ser Pro 
250 255 260 



883 



gaa ctg ata ate tct aac aat aag act ctg ate ttt get tea aac gta 
Glu Leu He He Ser Asn Asn Lys Thr Leu He Phe Ala Ser Asn Val 
265 270 275 



931 



gca gaa aca age ggt ggc gee ate cat get aaa aag eta gee ctt tec 
Ala Glu Thr Ser Gly Gly Ala He His Ala Lys Lys Leu Ala Leu Ser 
280 285 290 



979 



tct gga ggc ttt aca gag ttt eta cga aat aat gtc tea tea gca act 
Ser Gly Gly Phe Thr Glu Phe Leu Arg Asn Asn Val Ser Ser Ala Thr 
295 300 305 



1027 



cct aag ggg ggt get ate age ate gat gec tea gga gag etc agt ctt 
Pro Lys Gly Gly Ala He Ser He Asp Ala Ser Gly Glu Leu Ser Leu 
310 315 320 325 



1075 



tct gca gag aca gga aac att acc ttt gta aga aat acc ctt aca aca 
Ser Ala Glu Thr Gly Asn He Thr Phe Val Arg Asn Thr Leu Thr Thr 
330 335 340 



1123 



acc gga agt acc gat act cct aaa cgt aat gcg ate aac ata gga agt 
Thr Gly Ser Thr Asp Thr Pro Lys Arg Asn Ala He Asn lie Gly Ser 
345 350 355 



1171 



aac ggg aaa ttc acg gaa tta egg get get aaa aat cat aca att ttc 
Asn Gly Lys Phe Thr Glu Leu Arg Ala Ala Lys Asn His Thr He Phe 
360 365 370 



1219 



ttc tat gat ccc ate act tea gaa gga acc tea tea gac gta ttg aag 
Phe Tyr Asp Pro He Thr Ser Glu Gly Thr Ser Ser Asp Val Leu Lys 
375 380 385 



1267 



ata aat aac ggc tct gcg gga get etc aat cca tat caa gga acg att 
He Asn Asn Gly Ser Ala Gly Ala Leu Asn Pro Tyr Gin Gly Thr lie 
390 " 395 400 " 405 



1315 



eta ttt tct gga gaa acc eta aca gca gat gaa ctt aaa gtt ore gac 
Leu Phe Ser Gly Glu Thr Leu Thr Ala Asp Glu Leu Lys Val Ala Asp 
410 415 420 



1363 
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aat tta aaa tct tea ttc t^cg cag cca gtc tec eta tec gga gga aag 1411 

Asn Leu Lys Ser Ser Phe Thr Gin Pro Val Ser Leu Ser Gly Gly Lys 

425 430 435 

tta ttg eta caa aag gga gtc act tta gag age acg age ttc tct caa 1459 
Leu Leu Leu Gin Lys Gly Val Thr Leu Glu Ser Thr Ser Phe Ser Gin 
440 445 450 

gag gec ggt tct etc etc ggc atg gat tea gga acg aca tta tea act 1507 
Glu Ala Gly Ser Leu Leu Gly Met Asp Ser Gly Thr Thr Leu Ser Thr 
455 460 465 

aca get ggg agt att aca ate acg aac eta gga ate aat gtt gac tec 1555 
Thr Ala Gly Ser lie Thr lie Thr Asn Leu Gly lie Asn Val Asp Ser 
470 475 480 485 

tta ggt ctt aag cag ccc gtc age eta aca gca aaa ggt get tea aat 1603 
Leu Gly Leu Lys Gin Pro Val Ser Leu Thr Ala Lys Gly Ala Ser Asn 
490 495 500 

aaa gtg ate gta tct ggg aag etc aac ctg att gat att gaa ggg aac 1651 
Lys Val He Val Ser Gly Lys Leu Asn Leu He Asp He Glu Gly Asn 
505 510 515 

att tat gaa agt cat atg ttc age cat gac cag etc ttc tct eta tta 1699 
He Tyr Glu Ser His Met Phe Ser His Asp Gin Leu Phe Ser Leu Leu 
520 525 530 

aaa ate acg gtt gat get gat gtt gat act aac gtt gac ate age age 1747 
Lys He Thf Val Asp Ala Asp Val Asp Thr Asn Val Asp lie Ser Ser 
535 540 545 

ctt ate cct gtt cct get gag gat cct aat tea gaa tac gga ttc caa 1795 
Leu He Pro Val Pro Ala Glu Asp Pro Asn Ser Glu Tyr Gly Phe Gin 
550 555 560 565 

gga caa tgg aat gtt aat tgg act acg gat aca get aca aat aca aaa 1843 
Gly Gin Trp Asn Val Asn Trp Thr Thr Asp Thr Ala Thr Asn Thr Lys 
570 575 580 

gag gee acg gca act tgg acc aaa aca gga ttt gtt ccc age ccc gaa 18 91 
Glu Ala Thr Ala Thr Trp Thr Lys -Thr Gly Phe Val Pro Ser Pro Glu 
585 590 595 



aga aaa tct gcg tta gta tgc aat acc eta tgg gga gtc ttt act gac 
Arg Lys Ser Ala Leu Val Cys Asn Thr Leu Trp Gly Val Phe Thr Asp 
600 605 610 



1939 



att cgc tct ctg caa cag ctt gta gag ate ggc gca act ggt atg gaa 1987 

He Arg Ser Leu Gin Gin Leu Val Glu He Gly Ala Thr Gly Met Glu 

615 620 " 625 

cac aaa caa ggt ttc tgg gtt tec tec atg acg aac ttc ctg cat aaa ?035 

His Lys Gin Gly Phe Trp Val Ser Ser Met Thr Asn Phe Leu His Lvs 

630 635 640 645 
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act gga gat gaa aat cgc aaa ggc ttc cgt cat acc tct gga ggc tac 
Thr Gly Asp Glu Asn Arg Lys Gly Phe Arg His Thr Ser Gly Gly Tyr 
650 655 660 



2083 



gtc ate ggt gga agt get cac act cct aaa gac gac eta ttt acc ttt 2131 

Val lie Gly Gly Ser Ala His Thr Pro Lys Asp Asp Leu Phe Thr Phe 
66S 670 675 

gcg ttc tgc cat etc ttt get aga gac aaa gat tgt ttt ate get cac 2179 

Ala Phe Cys His Leu Phe Ala Arg Asp Lys Asp Cys Phe lie Ala His 

680 685 690 



aac aac tct aga acc tac ggt gga act tta ttc ttc aag cac tct cat 
Asn Asn Ser Arg Thr Tyr Gly Gly Thr Leu Phe Phe Lys His Ser His 
695 700 705 



2227 



acc eta caa ccc caa aac tat ttg aga tta gga aga gca aag ttt tct 2275 
Thr Leu Gin Pro Gin Asn Tyr Leu Arg Leu Gly Arg Ala Lys Phe Ser 
710 715 720 725 

gaa tea get ata gaa aaa ttc cct agg gaa att ccc eta gee ttg gat 2323 
Glu Ser Ala He Glu Lys Phe Pro Arg Glu He Pro Leu Ala Leu Asp 
730 735 740 



gtc caa gtt teg ttc age cat tea gac aac cgt atg gaa acg cac tat 
Val Gin Val Ser Phe Ser His Ser Asp Asn Arg Met Glu Thr His Tyr 
745 750 755 



2371 



acc tea ttg cca gaa tec gaa ggt tct tgg age aac gag tgt ata get 
Thr Ser Leu Pro Glu Ser Glu Gly Ser Trp Ser Asn Glu Cys He Ala 
760 765 770 



2419 



ggt ggt ate ggc eta gac ctt cct ttt gtt ctt tec aac cca cat cct 
Gly Gly He Gly Leu Asp Leu Pro Phe Val Leu Ser Asn Pro His Pro 
775 780 785 



2467 



ctt ttc aag acc ttc att cca cag atg aaa gtc gaa atg gtt tat gta 2515 
Leu Phe Lys Thr Phe He Pro Gin Met Lys Val Glu Met Val Tyr Val 
790 795 800 805 

tea caa aat age ttc ttc gaa age tct agt gat ggc cgt ggt ttt agt 2563 
Ser Gin Asn Ser Phe Phe Glu Ser Ser Ser Asp Gly Arg Gly Phe Ser 
810 815 820 

att gga agg ctg ctt aac etc zcg att cct gtg ggt gcg aaa ttc gtg 2611 
He Gly Arg Leu Leu Asn Leu Ser He Pro Val Gly Ala Lys Phe Val 
825 830 835 



cag ggg gat ate gga gat tec :ac acc tat gat etc tea gga ttc ttt 
Gin Gly Asp He Gly Asp Ser 7yr Thr Tyr Asp Leu Ser Gly Phe Phe 
840 345 850 



2659 



gtt tec gat gtc tat cgt aac aat ccc caa tct aca gcg act r^+rt grg 
Val Ser Asp Val Tyr Arg Asn Asn Pro Gin Ser Thr Ala Thr Leu Val 
855 660 865 



?70" 
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atg age cca gac tct tgg aaa att cgc ggt ggc aat ctt tea aga cag 
Met Ser Pro Asp Ser Trp Lys lie Arg Gly Gly Asn Leu Ser Arg Gin 
870 " 875 880 885 



2755 



gca ttt tta ctg agg ggt age aac aac tac gtc tac aac tec aat tgt 
Ala Phe Leu Leu Arg Gly Ser Asn Asn Tyr Val Tyr Asn Ser Asn Cys 
890 895 900 



2803 



gag, etc ttc gga cat tac get atg gaa etc cgt gga tct tea agg aac 
Glu Leu Phe Gly His Tyr Ala Met Glu Leu Arg Gly Ser Ser Arg Asn 
905 910 915 



2851 



tac aat gta gat gtt ggt ace aaa etc cga ttc tagattgeta aaactcccta 2904 
Tyr Asn Val Asp Val Gly Thr Lys Leu Arg Phe 
920 925 



gttcttctag ggagttttct catactttta gggaaatatt tgctat 



2950 



SUBSTITUTE SHEET (RULE 2B) 



WO 00/32794 



Title: 



CHLAMYDIA ANTIGENS ANI^f 
CORRESPONDING DNA 
FRAGMENTS AND USES THEREOF 
Inventor(s): Andrew D. MURDIN et al. ) 
DOCKET NO . : 03293 1 -0252 j 

; 



09/857128 

PCT/CA99/01I47 



Figure 2: Restriction enzyme analysis of CPN1006J4 

Alul 

Hpyl78III CviJI 
Apol (MspAlI 
Tsp509I | PvuII Msel SfaNI 

III I i 

GATAAAAATTCTTGACAGCTGTTTTGTCATCTTTAACTTGATTTACTTATTTTGTTTCTA 

1 - + + + " + + - + 

CTATTTTTAAGAACTGTCGACAAAACAGTAGAAATTGAACTAAATGAATAAAACAAAGAT 



60 



Nlalll 
MslI | 
I I 



BsaJT 
Styl 
MboII 
Hinf I 
Tfil 
Bbsl | 
Taql | 
I I 



TATTGATGCGAATAGTTCTCTAAAAAACAAAAG CATTAC CATGAAGACTTCG ATTCCTTG 

61 + + - + + + + 120 

ATAACTACGCTTATCAAGAGATTTTTTGTTTTCGTAATGGTACTTCTGAAGCTAAGGAAC 

HphI Maelll Alul 

Alul | Nlalll Taal CviJi 

CviJI |MaeIIl| Tsp45I Mnll | 
Mnllj |Tsp45l| Sfcl | Bfal || HphI 

Ml II I I I II I 
GGTTTTAGTTTCCTCCGTGTTAGCTTTCTCATGTCACCTACAGTCACTAGCTAACGAGGA 
121 + + + + + + 180 

CCAAAATCAAAGGAGGCACAATCGAAAGAGTACAGTGGATGTCAGTGATCGATTGCTCCT 



Msel 
Alul | 
CviJI 



I I 



BsaXI 
Alol 
Ppil 
Eco57I 

Acll 
Maell 
Hpyl78III 
Hinf I 
Tfil 
Clal | 
Taql | 



ACTT1TATCACCTGATGATAGCTTTAATGGAAATATCGATTCAGGAACGTTTACTCCAAA 
181 + + + + + + 240 

TGAAAATAGTGGACTACTATCGAAATTACCTTTATAGCTAAGTCCTTGCAAATGAGGTTT 
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BslI 

' EcoNI 
ScrFI | 

Bbsl CviJI [ | 

CviJI MboII EcoRII | | 

I I lit 

AACT TCAGCCACAACATAT TCTCT AACAGG AG ATGTC T T CT TT T ACG AGCCT G GAAAAGG 

241 + + + + + + 300 

TTGAAGTCGGTGTTGTATAAGAGATTGTCCTCTACAGAAGAAAATGCTCGGACCTTTTCC 



CjePI BsaJI CjePI 
Hpyl88IX Taal Mseli BstDSI BslI | 

II II I II 

CACTCCCTTATCTGACAGTTGTTTTAAGCAAACCACGGACAATCTTACCTTCTTGGGGAA 

301 + + + + + + 360 

GTGAGGGAATAGACTGTCAACAAAATTCGTTTGGTGCCTGTTAGAATGGAAGAACCCCTT 



CviRI 
Fnu4HI 
Tsel 
Fnu4HI 
AlwNI 
BstAPI 
Mwol 
Tsel 
Aarl 
MslI 
CviRI 



Acll 
Maell 
Msel j 
Aiul | i CviJI 
Taal CviJI ] (SfaNI | 
I II I It 



Nlalll 
Bbvl 
Bbvl | 
BspMI | i 
Cac8I| | | 
II I I 



CGGTCATAGCTTAACGTTTGGCTTTATAGATGCTGGCACTCATGCAGGTGCTGCTGCATC 

361 + + + + + + 420 

GCCAGTATCGAATTGCAAACCGAAATATCTACGACCGTGAGTACGTCCACGACGACGTAG 

BseMII 

Hinfl BslI | Ddel Hinfl 

SfaNI Tfil Ddel | | BseMII I Tf il 

I I I I I I I I 

TACAACAGCAAATAAGAATCTTACCTTCTCAGGGTTTTCCTTACTGAGTTTTGATTCCTC 

421 + + + + + + 4 80 

ATGTTGTCGT T TATTCTT AGAATGGAAGAGT CCCAAAAGGAATGAC TCAAAACTAAGGAG 



Maelll 
Mnll | 
Bfal | Taal Sfcl 
II II 



Mnll 
BbvCI I 
BpulOIl 
XmnI Ddel I 
I 1 I 



Apol 
Tsp509I 
BseMII I 
Mnll I I 
I I 1 



TCCTAGCACAACGGTT ACT AC AGG TCAGG GAACGCT T T CC TCAG C AGG AGG CGT AAATTT 

481 + + + + + + 540 

AGGATCGTGTTGCCAATGATGTCCAGTCCCTTGCGAAAGGAGTCGTCCTCCGCATTTAAA 
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Bed 

Apol PstI | Hgal 

Sspl Cjel CviRI | j Alulj 

Xmnl| Tsp509I Sfcl | j j Cvi JI | 

II I I I I I II 
AGAAAATATTCGTAAACTTGTAGTTGCTGGGAATTTTTCTACTGCAGATGGTGGAGCTAT 
541 + + + + + + 600 

TCTTTTATAAGCATTTGAACATCAACGACCCTTAAAAAGATGACGTCTACCACCTCGATA 

Hpyl78III Bpml 
BsrI | MboII | 

Cjel Msel SfaNI j Maelll I 

I I .1 1 II I 
CAAAGGAGCGTCTTTCCTTTTAACTGGCACTTCTGGAGATGCTCTTTTTAGTAACAACTC 
601 + + + + + + 660 

GTTTCCTCGCAGAAAGGAAAATTGACCGTGAAGACCTCTACGAGAAAAATCATTGTTGAG 

Cac8I 

Mwol Haell | 

Mnll Muni BseRI | Hhal | j 

Earl | Tsp509I Sfcl |Cac8I | j |MwoI 

II I I I I III I 

TTCATCAACAAAGGGAGGAGCAATTGCTACTACAGCAGGCGCTCGCATAGCAAATAACAC 

661 + --- + + + + + 720 

AAGTAGTTGTTTCCCTCCTCGTTAACGATGATGTCGTCCGCGAGCGTATCGTTTATTGTG 



HgaX 
BsaBI | 
Hinfl | | 
Tfil | j 

I I I 



Clal 

Hpyl7 8III Taql 
Maell | Haell | 
Mnll | Hhal | j 
AccI | j Hin4l| | | 



AGGTTATGTTAGATTCCTATCTAACATAGCGTCTACGTCAGGAGGCGCTATCGATGATGA 

721 + + + + + + 780 

TCCAATACAATCTAAGGATAGATTGTATCGCAGATGCAGTCCTCCGCGATAGCTACTACT 



BsmI 
Fnu4HI | 
CviRI | 

Taql Apol Tsel | 

Maell | Taql Tsp509I CjePI I | 

i i i i i ii 

AGGCACGTCGATACTATCGAACAACAAATTTCTATATTTTGAAGGGAATGCAGCGAAAAC 
781 + + + + + + 

TCCGTGCAGCTATGATAGCTTGTTGTTTAAAGATATAAAACTTCCCTTACGTCGCTTTTG 
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BsaJI 
Styl 
Bsbl 
CjePI 
CviRI 



Cjel 
Dpnl | 
Sau3AI | j 
BsrI | j j 
Acil | j|I 
Bbvl |J jjj 



841 



Alwl 
Hpyl78IIl| 
Dpnl 
BstYI | 
Hin4I j 
Sau3AI j | | Cjel BplI 

I I 

TACTGGCGGTGCGATCTGCAACACCAAGGCGAGTGGATCTCCTGAACTGATAATCTCTAA 
I. + + + + + 

ATGACCGCCACGCTAGACGTTGTGGTTCCGCTCACCTAGAGGACTTGACTATTAGAGATT 



900 



Bed 
Haell 
Hhal 

Dpnl NlaXV 
Sau3AI | BsaHI | 

Hpyl88IX| j Cjel Acil Narl | | | | Nlalll 

Plel Hinfl || j Maell | Fokl I BanI I Mwol 

I I II I II I I III 
C^TAAGACTCTGATCTTTGCTTCAAACGTAGCAGAAACAAGCGGTGGCGCCATCCATGC 
901 + + + + + + 

GTTATTCTGAGACTAGAAACGAAGTTTGCATCGTCTTTGTTCGCCACCGCGGTAGGTACG 



960 



CviJI 
Cac8I 
Bfalj 
Alul | | 

CviJI | | | Hpyl78III Mnll Pf 111081 

CjelNheljj j Mnll | CviJI | Bpml I BsmAI 

I III 

TAAAAAGCTAGCCCTTTCCTCTGGAGGCTTTACAGAGTTTCTACGAAATAATGTCTCATC 

961 + + + + + + 1020 

ATTTTTCGATCGGGAAAGGAGACCTCCGAAATGTCTCAAAGATGCTTTATTACAGAGTAG 



BSU36I 
Ddel 



BslI 



SfaNI 
I 



Clal 
TaqI 



BseMII 
BanI I 
BsiHKAI 
Bspl286l 
Sad 

Hpyl78III Ddel 
Bsu3 6I | Alulj 
Ddel j CviJI | 
SfaNI | | Mnll j 



CviRI 
Sfcl | 
BsmAI | | 

II I 



AGCAACTCCTAAGGGGGGTGCTATCAGCATCGATGCCTCAGGAGAGCTCAGTCTTTCTGC 
1021 + + + + + + 1080 

TCGTTGAGGATTCCCCCCACGATAGTCGTAGCTACGGAGTCCTCTCGAGTCAGAAAGACG 
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BseMII Mspl BsaXI 

PstI BsaWII Rsal I 

I Mil 
AGAGACAGGAAACATTACCTTTGTAAGAAATACCCTTACAACAACCGGAAGTACCGATAC 

1081 + + + + + + 1140 

TCTCTGTCCTTTGTAATGGAAACATTCTTTATGGGAATGTTGTTGGCCTTCATGGCTATG 



1141 



Tsp509I 
Sthl32I | Fnu4HI 
Apol I | CviJII 
Dpnl Maelll Tsp509I | | Tsel I 
Maell Sau3AI I Sthl32I I BscGI | Bbvl | | BscGI II 
1 It I I II ill III 
TCCTAAACGTAATGCGATCAACATAGGAAGTAACGGGAAATTCACGGAATTACGGGCTGC 
+ + + + + + 12 oo 

AGGATTTGCATTACGCTAGTTGTATCCTTCATTGCCCTTTAAGTGCCTTAATGCCCGACG 



Dpnl 
Sau3AI | 

Tsp509I Eco57I | | Hpyl88IX 
MboII I AlwII I I Bed | 
II Mill I 



Mnll 
Maelll 
Hpyl88IX I | 
NlalV | | | 

I 111 



TAAAAATCATACAATTTTCTTCTATGATCCCATCACTTCAGAAGGAACCTCATCAGACGT 

1201 + + + + + + 1260 

ATTTTTAGTATGTTAAAAGAAGATACTAGGGTAGTGAAGTCTTCCTTGGAGTAGTCTGCA 

Banll 
Bcefl 
BsiHKAI 
CviJI Bspl286I 
Faul | SacI 
MboII I Alul | Hinfl 

Sthl32I| I Acil CviJI | BslI Tfil 

I I I I II I I 

ATTGAAGATAAATAACGGCTCTGCGGGAGCTCTCAATCCATATCAAGGAACGATTCTATT 

1261 + + + + + + 1320 

TAACTTCTATTTATTGCCGAGACGCCCTCGAGAGTTAGGTATAGTTCCTTGCTAAGATAA 



Dral 
Msell 
MboII | | 

Hpyl78III Bpml Msel Tsp509I I I | 

I II I I II 

TTCTGGAGAAACCCTAACAGCAGATGAACTTAAAGTTGCTGACAATTTAAAATCTTCATT 
1321 + + + + -+ + 1380 

AAGACCTCTTTGGGATTGTCGTCTACTTGAATTTCAACGACTGTTAAATTTTAGAAGTAA 
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BsrI 
Cvi JI | 
Fnu4HI | | 
Tsell II 
II II 



Hpyl78III 
Mspl 
BsaWI 
BspEI 
Mnll I 
Bbvl I | 
BsmAI I I 
I I I 



Maelll 

Tsp45I Bce83l 
Hinfll Plel | 
II II 

CACGCAGCCAGTCTCCCTATCCGGAGGAAAGTTATTGCTACAAAAGGGAGTCACTTTAGA 

1381 + + + + + + 1440 

GTGCGTCGGTCAGAGGGATAGGCCTCCTTTCAATAACGATGTTTTCCCTCAGTGAAATCT 



Mspl 
BseRI 
BsrFI 
CviJI 
Haelll 
Hpyl78III 

Smll 
Mnll 



Hin4I 
Alul 
CviJI 
BsiHKAI I 
Bspl28 6I | 
BssSI I | 
II I 



1441 



Hpyl78III 
Hinfl 
Mnll 
Tfil 

BsaJI Nlallll 

BplI | BslI || I Cjel 

II III I I 

GAGCACGAGCTTCTCTCAAGAGGCCGGTTCTCTCCTCGGCATGGATTCAGGAACGACATT 
+ + + + + + 150Q 

CTCGTGCTCGAAGAGAGTTCTCCGGCCAAGAGAGGAGCCGTACCTAAGTCCTTGCTGTAA 



Alul 
CviJI 
MspAlI 
PvuII 
Sfcl | 
I I 



Hinfl 
Bfal | 
Avrlll | 
BsaJI I | 
Cjel | | 1 
Hpyl78III StyllTfil 
f III 



Bsu36I 
Hinfl | 
Hindi | | 
Plel I I Ddel 
I II I 



ATCAACT ACAGCT GGGAGTAT TACAATCACGAACCT AGGAATCAAT GTT GAC T CCTTAGG 

1501 + + + + + + 15 60 

TAGTTGATGTCGACCCTCATAATGTTAGTGCTTGGATCCTTAGTTACAACTGAGGAATCC 



Bbvl 
CviJI 
Sthl32I 
BscGI 
CviJI 



Fnu4HI 
Msel | 
Afllll | 
Smll | Tsell 
II II 



BstAPI Dpnl 
Mwol CjePI Sau3AI I 
I 111 
TCTTAAGCAGCCCGTCAGCCTAACAGCAAAAGGTGCTTCAAATAAAGTGATCGTATCTGG 
1561 + + + + + + 

AGAATTCGTCGGGCAGTCGGATTGTCGTTTTCCACGAAGTTTATTTCACTAGCATAGACC 



1620 
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Nlalll 
MboIl| 

Alul cvi JI | j 

CviJI CjePI Ndel CjePl| || 

>l I II II 

GAAGCTCAACCTGATTGATATTGAAGGGAACATTTATGAAAGTCATATGTTCAGCCATGA 

1621 + + + + + + 168 0 

CTTCGAGTTGGACTAACTATAACTTCCCTTGTAAATACTTTCAGTATACAAGTCGGTACT 

Msel 

Acelll | Taal Hindi 

Alul Earl | j CjePI | Acll | 

CviJI Sapl | jsfaNI . | j Bael Maell 

I III I I I I || 
CCAGCTCTTCTCTCTATTAAAAATCACGGTTGATGCTGATGTTGATACTAACGTTGACAT 
1681 + + + + + + 1740 

GGTCGAGAAGAGAGATAATTTTTAGTGCCAACTACGACTACAACTATGATTGCAACTGTA 



Hpyl88IX 
Alwl I 



Tsp509I 
Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
BbvCI 

Bael BpulOI 
CviJI | Mnll | 

Fnu4HI | j Bbvl | Ddel 
Tsel| | |BseMII |Alwl| 



BsaJI 
Styl 
Hinfl | 
Tfil 



CAGCAGCCTTATCCCTGTTCCTGCTGAGGATCCTAATTCAGAATACGGATTCCAAGGACA 
1741 + + + + + + isoo 

GTCGTCGGAATAGGGACAAGGACGACTCCTAGGATTAAGTCTTATGCCTAAGGTTCCTGT 



BsaJI 
BstDSI 
CviJI | 

Tsp509I Alul Haelj 

Msel | BciVI CviJI Mnll Haelll 

ii i i i ii i 

ATGGAATGTTAATTGGACTACGGATACAGCTACAAATACAAAAGAGGCCACGGCAACTTG 

1801 + + + + + + i860 

TACCTTACAATTAACCTGATGCCTATGTCGATGTTTATGTTTTCTCCGGTGCCGTTGAAC 



Avail 
Sau96I 



Beef I 



CviJI Sthl32I 



CviRI 

I II I 
GACCAAAACAGGATTTGTTCCCAGCCCCGAAAGAAAATCTGCGTTAGTATGCAATACCCT 
1861 + + + + + + 1920 

CTGGTTTTGTCCTAAACAAGGGTCGGGGCTTTCTTTTAGACGCAATCATACGTTATGGGA 
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Alul Dpnl 
Hinfl Plel Hin4I CviRI CviJI, Sau3AI I Hhal 

I I I II Mi 

ATGGGGAGTCTTTACTGACATTCGCTCTCTGCAACAGCTTGTAGAGATCGGCGCAACTGG 
1921 + + + + + + 1980 

TACCCCTCAGAAATGACTGTAAGCGAGAGACGTTGTCGAACATCTCTAGCCGCGTTGACC 

Bcgl Mnll 
Bsrl Tthlllll | Nlalll | CviRI 

I M II I 

TATGGAACACAAACAAGGTTTCTGGGTTTCCTCCATGACGAACTTCCTGCATAAGACTGG 
1981 + + + + + + 2Q40 

ATACCTTGTGTTTGTTCCAAAGACCCAAAGGAGGTACTGCTTGAAGGACGTATTCTGACC 

Maell 
BsaXI 

BsaBI Hin4I 
Bcgl | Bpml Hpyl7 8III Mnll 

Bsrl | j CviJI Mnll | Cvi Ji | | B pml 

II I I I I il I | 



AGATGAAAATCGCAAAGGCTTCCGTCATACCTCTGGAGGCTACGTCATCGGTGGAAGTGC 

"_~ + + + **" + 

TCTACTTTTAGCGTTTCCGAAGGCAGTATGGAGACCTCCGATGCAGTAGCCACCTTCACG 



BsiHKAI Bfal 
Bspl286I BccI BsmAI I 

I III 

TCACACTCCTAAAGACGACCTATTTACCTTTGCGTTCTGCCATCTCTTTGCTAGAGACAA 
2101 + + + + + + 216Q 

AGTGTGAGGATTTCTGCTGGATAAATGGAAACGG^GACGGTAGAGAAACGATCTCTGTT 

Hpyl78III 
Bfal | 

Xbal| j MboII 
Cjel Ml Taal | cjel 



AGATTGTTTTATCGCTCACAACAACTCTAGAACCTACGGTGGAACTTTATTCTTCAAGCA 
+ + + + + + 

TCTAACAAAATAGCGAGTGTTGTTGAGATCTTGGATGCCACCTTGAAATAAGAAGTTCGT 



Hinfl 
Tfil 

Earl Hpyl88IX | 

Tthlllll Sapl MboII I 

i I ill 

CT CTC AT AC C CTACAAC C C CAAAAC T ATTTG AG ATT AG G AAG AG CAAAGTTTT CTG AATC 

2221 + + + + + + 2280 

GAGAGTATGGGATGTTGGGGTTTTGATAAACTCTAATCCTTCTCGTTTCAAAAGACTTAG 
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Sfcl 
Alul! 



A^ol 
Tsp509I 
Bfal 
Avrll | 
Apol BsaJI | 



CviJl) Tsp509I Styl| 



BslI 
Cjel 
BsaJI | 
Styi| 
cvi ji | I 
Bfal | | | 
I III 



Fokl CviJI 



AGCTATAGAAAAATTCCCTAGGGAAATTCCCCTAGCCTTGGATGTCCAAGTTTCGTTCAG 

2281 - + " + + + + + 2340 

TCGATATCTTTTTAAGGGATCCCTTTAAGGGGATCGGAACCTACAGGTTCAAAGCAAGTC 



Taal 

Hpyl88IX Cjel | 
I I I 



Hpyl8 8IX 
Hinf I 

Tfil 
Mnll | 
BsrDl CjePl | I 



2341 



CCATTCAGACAACCGTATGGAAACGCACTATACCTCATTGCCAGAATCCGAAGGTTCTTG 



-+ 2400 



GGTAAGTCTGTTGGCATACCTTTGCGTGATATGGAGTAACGGTCTTAGGCTTCCAAGAAC 



Alul 
CviJI 
CjePI | 



Bfal 

CviJI | Fokl 
Haelll | CjePI | 

II II II 
GAGCAACGAGTGTATAGCTGGTGGTATCGGCCTAGACCTTCCTTTTGTTCTTTCCAACCC 
2401 + + + - + + + 2460 

CTCGTTGCTCACATATCGACCACCATAGCCGGATCTGGAAGGAAAACAAGAAAGGTTGGG 

CjePI 
Mmel 
Mnll | 
RleAI j 

Hpyl7 8IIl| || Taql 
II II i 
ACATCCTCTTTTC^UVGACCTTCATTCCACAGATGAAAGTCGAAATGGTTTATGTATO^CA 
2461 + + + + + + 2520 

TGTAGGAGAAAAGTTCTGGAAGTAAGGTGTCTACTTTCAGCTTTACCAAATACATAGTGT 



BsaJI 
BstDSI 
CviJI | 
Haelll | 

Bed | j Fnu4HI 
Eael | j CviJI | 

MboII | Taql I I I Gciill j| Bbvl Tsel Msel 

i n i mi 

AAATAGCTTCTTCGAAAGCTCTAGTGATGGCCGTGGTTTTAGTATTGGAAGGCTGCTTAA 

2521 + + + + + + 2580 

TTTATCGAAGAAGCTTTCGAGATCACTACCGGCACCAAAATCATAACCTTCCGACGAATT 



Alul 
CviJI 



Bfal 
Alul 
CviJI 
Bcefl | 
NspV | j 
Taql | | 
I I I 
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Mnll 
Bcgl 
Hinf I 
Tfil 
Hpyl78III 
TaqI 
TaqI I 
RleAI | 
I I 



Apol 
Tsp509I 



CviRI 



Bsgl 
Hinf I 
Tfil 
Bcgl | 
HpylSSIXj j 
ECORV | j j 

II I 



2581 



CCTCTCGATTCCTGTGGGTGCGAAATTCGTGCAGGGGGATATCGGAGATTCCTACACCTA 



Hpyl88IX Maelll 
BseMII | BsaBI | 



Plel 
Sfcl| Hinf I 



GGAGAGCTAAGGACACCCACGCTTTAAGCACGTCCCCCTATAGCCTCTAAGGATGTGGAT 

Hinf I 
Tfil 
Hpyl78III | 
Dpnl | | 

Sau3AI |DdeI | | 

I I I I I II II || | 

TGATCTCTCAGGATTCTTTGTTTCCGATGTCTATCGTAACAATCCCCAATCTACAGCGAC 
2641 + + + + + + 

ACTAGAGAGTCCTAAGAAACAAAGGCTACAGATAGCATTGTTAGGGGTTAGATGTCGCTG 

Apol 
Tsp509I 
BstXI 
Hinf I 
Banll 



-+ 2640 



2700 



Bspl286I 
CviJI | 
Plel | | 



Acil Mnn 
Thai Hpyl7 8III BseMII | 



2701 



TCTTGTGATGAGCCCAGACTCTTGGAAAATTCGCGGTGGCAATCTTTCAAGACAGGCATT 



AGAACACTACTCGGGTCTGAGAACCTTTTAAGCGCCACCGTTAGAAAGTTCTGTCCGTAA 



-+ 2760 



Earl 
Sap I 
Hpyl88IX| 
Ban 1 1 
BsiHKAI 
Bspl286I 
MboII SacI 
AccI Muni | Alul | 

Ddel Maell | Tsp509I j CviJI 

I II I I I I 
TTTACTGAGGGGTAGCAACAACTACGTCTACAACTCCAATTGTGAGCTCTTCGGACATTA 
2761 + + + - + + + 2820 

AAATGACTCCCCATCGTTGTTGATGCAGATGTTGAGGTTAACACTCGAGAAGCCTGTAAT 
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Dpnl 
BstYI 
Sau3AI 
BsaJI | 
BstDSI J 
MboIl| I 
II I 



Alwl 



Hinf I 
Tfil 
Hpyl88IX 
Kpnl 
NlalV | 
Rsal | 
BanI | I 

I I I 



2821 



CGCTATGGAACTCCGTGGATCTTCAAGGAACTACAATGTAGATGTTGGTACCAAACTCCG 
I + + + + + 

GCGATACCTTGAGGCACCTAGAAGTTCCTTGATGTTACATCTACAACCATGGTTTGAGGC 



2880 



Hpyl78III 

Bfal| Bfal 
Xbal| I MboII | Bfal 

til III 
ATTCTAGATTGCTAAAACTCCCTAGTTCTTCTAGGGAGTTTTCTCATACTTTTAGGGAAA 



2881 



TAAGATCTAACGATTTTGAGGGATCAAGAAGATCCCTCAAAAGAGTATGAAAATCCCTTT 
Sspl 

I 

TATTTGCTAT 

2941 + 2950 

ATAAACGATA 



-+ 2940 
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J 

Figure 3: Nucleotide and deduced amino acid sequence of CPN100635 

agttaaagat gacaaaacag ctgtcaagaa tttttatctt gactctctga gttttctatt 60 

ttatatgacg caagtaagaa tttaataata aagtgggttt atg aaa teg caa ttt 115 

Met Lys Ser Gin Phe 
1 5 

tec tgg tta gtg etc tct teg aca ttg gca tgt ttt act agt tgt tec 163 
Ser Trp Leu Val Leu Ser Ser Thr Leu Ala Cys Phe Thr Ser Cys Ser 
10 15 20 

act gtt ttt get gca act get gaa aat ata ggc ccc tct gat age ttt 211 
Thr Val Phe Ala Ala Thr Ala Glu Asn He Gly Pro Ser Asp Ser Phe 
25 30 35 

gac gga agt act aac aca ggc ace tat act cct aaa aat acg act act 259 

Asp Gly Ser Thr Asn Thr Gly Thr Tyr Thr Pro Lys Asn Thr Thr Thr 

Gly Thr Tyr Thr Pro Lys Asn Thr Thr Thr 
40 45 50 

gga ata gac tat act ctg aca gga gat ata act ctg caa aac ctt ggg 307 

Gly He Asp Tyr Thr Leu Thr Gly Asp He Thr Leu Gin Asn Leu Gly 

Gly He Asp Tyr Thr Leu Thr Gly Asp He Thr Leu Gin Asn Leu Gly 
55 60 65 

gat teg gca get tta acg aag ggt tgt ttt tct gac act acg gaa tct 355 

Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser Asp Thr Thr Glu Ser 

Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser Asp Thr Thr Glu Ser 
70 75 80 85 

tta age ttt gee ggt aag ggg tac tea ctt tct ttt tta aat att aag 4 03 

Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser Phe Leu Asn He Lys 

Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser Phe Leu Asn He Lys 
90 95 100 

tct agt get gaa ggc gca gec ctt tct gtt aca act gat aaa aat ctg 451 

Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr Thr Asp Lys Asn Leu 

Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr Thr Asp Lys Asn Leu 
105 110 us 

teg eta aca gga ttt teg agt ctt act ttc tta gcg gee cca tea teg 499 
Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu Ala Ala Pro Ser Ser 
Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu Ala Ala Pro Ser Ser 
120 125 130 

gta ate aca acc ccc tea gga aaa ggt gca gtt aaa tgt gga ggg gac 547 

Val He Thr Thr Pro Ser Gly Lys Gly Ala Val Lys Cys Gly Gly Asp 

Val He Thr Thr Pro Ser Gly Lys Gly Ala Val Lys Cys Gly Gly Asp 

135 140 145 

ctt aca ttt gat aac aat gga act att tta ttt aaa caa gat tac tgt 595 

Leu Thr Phe Asp Asn Asn Gly Thr He Leu Phe Lys Gin Asp Ty r Cys 

Leu Thr Phe Asp Asn Asn Gly Thr He Leu Phe Lys Gin Asp Tyr Cys 

150 155 160 " ' 165 
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gag gaa aat ggc gga gcc 
Glu Glu Asn Gly Gly Ala 
Glu Glu Asn Gly Gly Ala 
170 

age acg gga teg att tct 
Ser Thr Gly Ser lie Ser 
Ser Thr Gly Ser lie Ser 
185 

aaa aaa ggt ggg get att 
Lys Lys Gly Gly Ala lie 
Lys Lys Gly Gly Ala lie 
200 

aat acg get cct acc etc 
Asn Thr Ala Pro Thr Leu 
Asn Thr Ala Pro Thr Leu 
215 



att tct acc aag aat ctt 
lie Ser Thr Lys Asn Leu 
lie Ser Thr Lys Asn Leu 
175 

ttt gaa ggg aat aaa teg 
Phe Glu Gly Asn Lys Ser 
Phe Glu Gly Asn Lys Ser 
190 

tgt get act ggt act gta 
Cys Ala Thr Gly Thr Val 
Cys Ala Thr Gly Thr Val 
205 

ttc teg aac aat att get 
Phe Ser Asn Asn lie Ala 
Phe Ser Asn Asn lie Ala 
220 225 



tct ttg aaa aac 643 
Ser Leu Lys Asn 
Ser Leu Lys Asn 
180 

age gca aca ggg 691 
Ser Ala Thr Gly 
Ser Ala Thr Gly 
195 

gat att aca aat 739 
Asp lie Thr Asn 
Asp lie Thr Asn 
210 

gaa get gca ggt 787 
Glu Ala Ala Gly 
Glu Ala Ala Gly 



gga get ata aat 
Gly Ala He Asn 
Gly Ala He Asn 
230 



age aca gga aac 
Ser Thr Gly Asn 
Ser Thr Gly Asn 
235 



tgt aca att aca 
Cys Thr He Thr 
Cys Thr He Thr 
240 



ggg aat acg tct 835 
Gly Asn Thr Ser 
Gly Asn Thr Ser 
245 



ctt gta ttt tct gaa aat agt gtg aca gcg acc gca gga aat gga gga 883 
Leu Val Phe Ser Glu Asn Ser Val Thr Ala Thr Ala Gly Asn Gly Gly 
Leu Val Phe Ser Glu Asn Ser Val Thr Ala Thr Ala Gly Asn Gly Gly 
250 255 260 



get ctt tct gga gat gcc gat gtt acc ata tct ggg aat cag agt gta 931 

Ala Leu Ser Gly Asp Ala Asp Val Thr He Ser Gly Asn Gin Ser Val 

Ala Leu Ser Gly Asp Ala Asp Val Thr He Ser Gly Asn Gin Ser Val 

265 270 275 



act ttc tea gga aac caa get gta get aat ggc gga gcc att tat get 979 
Thr Phe Ser Gly Asn Gin Ala Val Ala Asn Gly Gly Ala He Tyr Ala 
Thr Phe Ser Gly Asn Gin Ala Val Ala Asn Gly Gly Ala He Tyr Ala 
280 285 290 



aag aag ctt aca ctg get tec ggg ggg ggg ggg ggg aat ccc ttt tct 1027 

Lys Lys Leu Thr Leu Ala Ser Gly Gly Gly Gly Gly Asn Pro Phe Ser 

Lys Lys Leu Thr Leu Ala Ser Gly Gly Gly Gly Gly Asn Pro Phe Ser 

295 300 " 305 



aac aat ata gtc 
Asn Asn He Val 
Asn Asn He Val 
310 

ata ctg gca get 
lie Leu Ala Ala 
He Leu Ala Ala 



caa ggt acc act 
Gin Gly Thr Thr 
Gin Gly Thr Thr 
315 

gga gag tgt agt 
Gly Glu Cys Ser 
Gly Glu Cys Ser 
330 



gca ggt aat ggt 
Ala Gly Asn aly 
Ala Gly Asn Gly 
320 

ctt ttc age gaa 
Leu Phe Ser Glu 
Leu Phe Ser Glu 
335 



gga gcc att tct 1075 
Gly Ala He Ser 
Gly Ala He Ser 
325 

gca ggg gac cat 112 3 
Ala Gly Asp His 
Ala Gly Asp His 
340 
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tac ctt aat ggg aat gcc att gtt gca act aca cca caa act aca aaa 1171 

Ty,.* Leu Asn Gly Asn Ala lie Val Ala Thr Thr Pro Gin Thr Thr Lys 

Tyr Leu Asn Gly Asn Ala lie Val Ala Thr Thr Pro Gin Thr Thr Lys 

34S 350 355 

aga aat tct att gac ata gga tct act ggc aaa gat cac gaa tta cgt 1219 

Arg Asn Ser lie Asp He Gly Ser Thr Gly Lys Asp His Glu Leu Arg 

Arg Asn Ser He Asp He Gly Ser Thr Gly Lys Asp His Glu Leu Arg 
360 365 370 

gca ata tct ggg cat age ate ttt ttc tac gat ccg att act get aat 1267 

Ala He Ser Gly His Ser He Phe Phe Tyr Asp Pro He Thr Ala Asn 

Ala He Ser Gly His Ser He Phe Phe Tyr Asp Pro lie Thr Ala Asn 

375 380 385 

acg get gcg gat tct aca gat act tta aat etc aat aag get gat gca 1315 
Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu Asn Lys Ala Asp Ala 
Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu Asn Lys Ala Asp Ala 
390 395 400 405 

ggt aat agt aca gat tat agt ggg teg att gtt ttt tct ggt gaa aag 1363 

Gly Asn Ser Thr Asp Tyr Ser Gly Ser He Val Phe Ser Gly Glu Lys 

Gly Asn Ser Thr Asp Tyr Ser Gly Ser He Val Phe Ser Gly Glu Lys 

410 415 420 

etc tct gaa gat gaa gca aaa gtt gca gac aac etc act tct acg ctg 1411 

Leu Ser Glu Asp Glu Ala Lys Val Ala Asp Asn Leu Thr Ser Thr Leu 

Leu Ser Glu Asp Glu Ala Lys Val Ala Asp Asn Leu Thr Ser Thr Leu 
425 430 435 

aag cag cct gta act eta act gca gga aat tta gta ctt aaa cgt ggt 14 59 
Lys Gin Pro Val Thr Leu Thr Ala Gly Asn Leu Val Leu Lys Arg Gly 
Lys Gin Pro Val Thr Leu Thr Ala Gly Asn Leu Val Leu Lys Arg Gly 
440 445 450 

gtc act etc gat acg aaa ggc ttt act cag ace gcg ggt tec tct gtt 1507 

Val Thr Leu Asp Thr Lys Gly Phe Thr Gin Thr Ala Gly Ser Ser Val 

Val Thr Leu Asp Thr Lys Gly Phe Thr Gin Thr Ala Gly Ser Ser Val 
455 460 465 

att atg gat gcg ggc aca acg tta aaa gca agt aca gag gag gtc act 1555 

lie Met Asp Ala Gly Thr Thr Leu Lys Ala Ser Thr Glu Glu Val Thr 

lie Met Asp Ala Gly Thr Thr Leu Lys Ala Ser Thr Glu Glu Val Thr 

470 475 480 485 

tta aca ggt ctt tec att cct gta gac tct tta ggc gag ggt aag aaa 1603 
Leu Thr Gly Leu Ser He Pro Val Asp Ser Leu Gly Glu Gly Lys Lys 
.Leu Thr Gly Leu Ser lie Pro Val Asp Ser Leu Gly Glu Gly Lys Lys 
490 495 " 500 

gtt gta att get get tct gca gca agt aaa aat c:a gcc ctt age ggt 1651 

Val Val lie Ala Ala Ser Ala Ala Ser Lys Asn Val Ala Leu Ser Gly 

Val Val lie Ala Ala Ser Ala Ala Ser Lys Asn Val Ala Leu Ser Glv 

505 510 515 
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ccg att ctt ctt ttg gat aac caa ggg aat get tat gaa aat cpz gac 1699 

Pro lie Leu Leu Leu Asp Asn Gin Gly Asn Ala Tyr Glu Asn His Asp 

Pro lie Leu Leu Leu Asp Asn Gin Gly Asn Ala Tyr Glu Asn His Asp 

520 525 530 



tta gga aaa act 
Leu Gly Lys Thr 
Leu Gly Lys Thr 
535 

act gca aca act 
Thr Ala Thr Thr 
Thr Ala Thr Thr 
550 

acg cac tat ggg 
Thr His Tyr Gly 
Thr His Tyr Gly 



caa gac ttt tea 
Gin Asp Phe Ser 
Gin Asp Phe Ser 
540 

aca gat gtt cca 
Thr Asp Val Pro 
Thr Asp Val Pro 
555 

tat caa ggt act 
Tyr Gin Gly Thr 
Tyr Gin Gly Thr 
570 



ttt gtg cag etc 
Phe Val Gin Leu 
Phe Val Gin Leu 
545 

gcg gtt cct aca 
Ala Val Pro Thr 
Ala Val Pro Thr 
560 

tgg ^tg act 

Trp Gly Met Thr 
Trp Gly Met Thr 
575 



tct get ctg ggt 1747 
Ser Ala Leu Gly 
Ser Ala Leu Gly 



gta gca act cct 1795 
Val Ala Thr Pro 
Val Ala Thr Pro 
565 

tgg gtt gat gat 1843 
Trp Val Asp Asp 
Trp Val Asp Asp 
580 



ace gca age act cca aag 
Thr Ala Ser Thr Pro Lys 
Thr Ala Ser Thr Pro Lys 
585 

aca ggc tac ctt ccg aat 
Thr Gly Tyr Leu Pro Asn 
Thr Gly Tyr Leu Pro Asn 
600 

age ctt tgg gga tct ttt 
Ser Leu Trp Gly Ser Phe 
Ser Leu Trp Gly Ser Phe 
615 



act aag aca gcg aca tta 
Thr Lys Thr Ala Thr Leu 
Thr Lys Thr Ala Thr Leu 
590 

cct gag cgt caa gga cct 
Pro Glu Arg Gin Gly Pro 
Pro Glu Arg Gin Gly Pro 
605 

tea gac ate caa gcg att 
Ser Asp lie Gin Ala lie 
Ser Asp lie Gin Ala lie 
620 625 



get tgg ace aat 1891 
Ala Trp Thr Asn 
Ala Trp Thr Asn 
595 

tta gtt cct aat 1939 
Leu Val Pro Asn 
Leu Val Pro Asn 
610 

caa ggt gtc ata 1987 
Gin Gly Val He 
Gin Gly Val He 



gag aga agt get ttg act ctt tgt tea gat cga ggc ttc tgg get gcg 203 5 

Glu Arg Ser Ala Leu Thr Leu Cys Ser Asp Arg Gly Phe Trp Ala Ala 

Glu Arg Ser Ala Leu Thr Leu Cys Ser Asp Arg Gly Phe Trp Ala Ala 
630 635 640 645 

gga gtc gee aat ttc tta gat aaa gat aag aaa ggg gaa aaa cgc aaa 2 083 

Gly Val Ala Asn Phe Leu Asp Lys Asp Lys Lys Gly Glu Lys Arg Lys 

Gly Val Ala Asn Phe Leu Asp Lys Asp Lys Lys Gly Glu Lys Arg Lys 

650 655 660 

tac cgt cat aaa tct ggt gga tat get ate gga ggc gca gcg caa act 2131 

Tyr Arg His Lys Ser Gly Gly Tyr Ala He Gly Gly Ala Ala Gin Thr 

Tyr Arg Kis Lys Ser Gly Gly Tyr Ala He Gly Gly Ala Ala Gin Thr 

665 670 * 675 

tgt tct caa aac tta att age rtt gec ttt zgc caa etc ttt ggt age 2179 

Cys Ser Glu Asn Leu He Ser Phe Ala Phe Cys Gin Leu Phe Gly Ser 

Cys Ser Glu Asn Leu He Ser Phe Ala Phe Cys Gin Leu Phe Gly Ser 
680 685 690 
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gat aaa gat ttc tta gtc get aaa aat cat act gat acc tat gca gga 

Asp Lys Asp Phe Leu Val Ala Lys Asn His Thr Asp Thr Tyr Ala Gly 

Asp Lys Asp Phe Leu Val Ala Lys Asn His Thr Asp Thr Tyr Ala Gly 
695 700 70S 



2227 



gec ttc tat ate 
Ala Phe Tyr lie 
Ala Phe Tyr lie 
710 

etc tta gat aaa 
Leu Leu Asp Lys 
Leu Leu Asp Lys 



caa cac att aca 
Gin His lie Thr 
Gin His lie Thr 
715 

ctt cct ggc tct 
Leu Pro Gly Ser 
Leu Pro Gly Ser 
730 



gaa tgt agt ggg 
Glu Cys Ser Gly 
Glu Cys Ser Gly 
720 

fc 99 agt cat aaa 
Trp Ser His Lys 
Trp Ser His Lys 
735 



ttc ata ggt tgt 2275 
Phe lie Gly Cys 
Phe lie Gly Cys 
725 

ccc etc gtt tta 2323 
Pro Leu Val Leu 
Pro Leu Val Leu 
740 



gaa ggg cag etc get tat age cac gtc agt aat gat ctg aag aca aag 2371 

Glu Gly Gin Leu Ala Tyr Ser His Val Ser Asn Asp Leu Lys Thr Lys 

Glu Gly Gin Leu Ala Tyr Ser His Val Ser Asn Asp Leu Lys Thr Lys 
745 750 755 

tat act gcg tat cct gag gtg aaa ggt tct tgg ggg aat aat get ttt 2419 
Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp Gly Asn Asn Ala Phe 
Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp Gly Asn Asn Ala Phe 
760 765 770 

aac atg atg ttg gga get tct tct cat tct tat cct gaa tac ctg cat 2467 

Asn Met Met Leu Gly Ala Ser Ser His Ser Tyr Pro Glu Tyr Leu His 

Asn Met Met Leu Gly Ala Ser Ser His Ser Tyr Pro Glu Tyr Leu His 
775 780 785 

tgt ttt gat acc tat get cca tac ate aaa ctg aat ctg acc tat ata 2515 

Cys Phe Asp Thr Tyr Ala Pro Tyr lie Lys Leu Asn Leu Thr Tyr lie 

Cys Phe Asp Thr Tyr Ala Pro Tyr lie Lys Leu Asn Leu Thr Tyr lie 

790 795 800 80S 

cgt cag gac age ttc teg gag aaa ggt aca gaa gga aga tct ttt gat 2563 

Arg Gin Asp Ser Phe Ser Glu Lys Gly Thr Glu Gly Arg Ser Phe Asp 

Arg Gin Asp Ser Phe Ser Glu Lys Gly Thr Glu Gly Arg Ser Phe Asp 
810 815 820 

gac age aac etc ttc aat tta tct ttg cct ata egg gtg aag ttt gag 2611 

Asp Ser Asn Leu Phe Asn Leu Ser Leu Pro lie Gly Val Lys Phe Glu 

Asp Ser Asn Leu Phe Asn Leu Ser Leu Pro lie Gly Val Lys Phe Glu 

825 830 835 

aag ttc tct gat tgt aat gac ttt tct tat gat ccg act tta tec cat 2659 

Lys Phe Ser Asp Cys Asn Asp Phe Ser Tyr Asp Leu Thr Leu Ser Tvr 

Lys Phe Ser Asp Cys Asn Asp Phe Ser Tyr Asp Leu Thr Leu Ser Tyr 

840 845 850 

gtt cc: gat cz:z ate cgc aac gac ccc aaa cgc acc aca cca etc gca 2707 

Val Pro Asp Leu lie Arg Asn Asp Pro Lys Cys Thr Thr Ala Leu Val 

Val Pro Asp Leu lie Arg Asn Asp Pro Lys Cys Thr Thr Ala Leu Val 
855 860 £55 



SUBSTITUTE SHEET (RULE 26) 



WO 00/32794 



Fig. 3 (con't) 



Title: CHLAMYDIA ANTIGENS AND 
CORRESPONDING DNA A 
FRAGMENTS AND USES THER^P 
Inventor(s): Andrew D. MURDIN et al. 
DOCKET NO.: 032931-0252 



PCT/CA99/01147 



ate age gga gec tct tgg, gaa act tat gec aat aac tta gca cga cag 2755 

He Ser Gly Ala Ser 'Jrp Glu Thr Tyr Ala Asn Asn Leu Ala Arg Gin 

He Ser Gly Ala Ser Trp Glu Thr Tyr Ala Asn Asn Leu Ala Arg Gin 

870 875 880 885 

gec ttg caa gtg cgt gca ggc agt cac tac gec ttc tct cct atg ttt 2803 

Ala Leu Gin Val Arg Ala Gly Ser His Tyr Ala Phe Ser Pro Met Phe 

Ala Leu Gin Val Arg Ala Gly Ser His Tyr Ala Phe Ser Pro Met Phe 
890 895 900 

gaa gtg etc ggc cag ttt gtc ttt gaa gtt cgt gga tec tea egg att 2851 

Glu Val Leu Gly Gin Phe Val Phe Glu Val Arg Gly Ser Ser Arg He 

Glu Val Leu Gly Gin Phe Val Phe Glu Val Arg Gly Ser Ser Arg He 
905 910 915 

tat aat gta gat ctt ggg ggt aag ttc caa ttc taggagegtc tctcatgtct 2904 
Tyr Asn Val Asp Leu Gly Gly Lys Phe Gin Phe 
Tyr Asn Val Asp Leu Gly Gly Lys Phe Gin Phe 
920 925 



cagaaattct gagagagatc gcatttagga ttttcttaaa cacgac 



2950 
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Figure 4: Restriction enzyme analysis of CPN1 00635 



Msel 



Apol 
Tsp509I 
Hpyl78III 
Alul 
CviJI 
MspAlI 
PvuII 



Ddel 
Hpyl88IX 
Hinfl 
Hpyl78III | 
BseMII | j 
Plel | » j 



AGTTAAAGATGACAAAACAGCTGTCAAGAATTTTTATCTTGACTCTCTGAGTTTTCTATT 



TCAATTTCTACTGTTTTGTCGACAGTTCTTAAAAATAGAACTGAGAGACTCAAAAGATAA 



■ + 60 



Apol 
Tsp509I 
Hgal | Msel 
I I I 



Tsp509I 



ScrFI 
EcoRII | 



61 



TTATATGACGCAAGTAAGAATTTAATAATAAAGTGGGTTTATGAAATCGCAATTTTCCTG 
+ + + + + + 

AATATACTGCGTTCATTCTTAAATTATTATTTCACCCAAATACTTTAGCGTTAAAAGGAC 



BsiHKAI 

Bspl286I Earl Nlalll Bfal 

MboII | Taql | MslI Nspl Spel\ 



121 



181 



Mwol 
CviRI 
Pnu4HI | 
TspRI | j 
Bbvl Taal iTsel 

> IM I I II | | | III 

GTTAGTGCTCTCTTCGACATTGGGATGTTTTAC 

+ + + + + + 

CAATCACGAGAGAAGCTGTAACCGTACAAAATGATCAACAAGGTGACAAAAACGACGTTG 

Hpyl88IX 
NlalV 
CviJI | 

Haelllj | Mnll Rsal 
EcoOl09I|| j Alul | Seal 
Sau96I|| j CviJI j TatI j 

III 

TGCTOAAAATATAGGCCCCTCTGATAGCTTO^ 

" * + + + + + 240 

ACGACTTTTATATCCGGGGAGACTATCGAAACTGCCTTCATGATTGTGTCCGTGGATATG 



NlalV 
BanI | BsaXI 



Cjel 



Bsrl 



Cjel 
Hpyl8 8IX 



CviRI 



241 



TCCTAAAAATACGACTACTGGAATAGACTATACTCTGACAGGAGATATAACTCTGCAAAA 

" * + + - - - + + + 

AGGATTTTTATGCTGATGACCTTATCTGATATGAGACTGTCCTCTATATTGAGACGTTTT 



300 
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BsaJI 
Sty I 



Msel 
Alul 
CviJI 
Hinfl Fnu4HI | 
Tfil Tsel| | 
I II I 



Bbvl 



Hpyl8 8IX 



Alul 
CviJI 

Hinfl Hindlll | 
Tfil Msel | | 
I I I 



301 



CCTTGGGGATTCGGCAGCTTTAACGAAGGGTTGTTTTTCTGACACTACGGAATCTTTAAG 
+ + + + H + 

GGAACCCCTAAGCCGTCGAAATTGCTTCCCAACAAAAAGACTGTGATGCCTTAGAAATTC 



360 



361 



Fnu4HI 
Hhal | 

Mspl Dral Msel Tselj 

BsrFl| BslI Rsal ' Msel I Sspl j Bfal Mwol II 

M i i ii i i i i !i 

CTTTG C CGGTAAGGGGTACTCACTTTCTTTTTTAAAT ATTAAGT CT AGTG CTGAAGG CG C 
+ + + + + ^ 

GAAACGGCCATTCCCC^TGAGTGAAAGAAAAAATTTATAATTCAGATCACGACTTCCGCG 



420 



Hinfl 
Taql | 



Eco57I 
Bbvl | 

CviJI MaeIIl|| BsaBI Taql | Plel 

i in i i i i 

AGCCCTTTCTGTTACAACTGATAAAAATCTGTCGCTAACAGGATTTTCGAGTCTTACTTT 
421 + + + + + + 480 

TCGGGAAAGACAATGTTGACTATTTTTAGACAGCGATTGTCCTAAAAGCTCAGAATGAAA 

Bed 
NlalV 
CviJI 
Haelll 
Sau96I 
Fnu4HI | 

Taulj | | | Hpyl78III 
Acil| j | | | Cjel | 

c IJ eI I I I I I I BSU36I | j 

Ddel| Mil l fi sll Ddel | | 

II II 



Mnll 
Msel 
BseMII | 
CviRI | j 
Mnll | | j 
I III 



CTTAGCGGCCCCATCATCGGTAATCACAACCCCCTCAGGAAAAGGTGCAGTTAAATGTGG 
481 + + + + + + 540 

GAATCGCCGGGGTAGTAGCCATTAGTGTTGGGGGAGTCCTTTTCCACGTCAATTTACACC 



Bsgl 
Dpnl 
BstYI | 
Sau3AI 



Dral Taal 
Alwl Msel I Mnll I 

i i i nil 

AGGGGATCTTACATTTGATAACAATGGAACTATTTTATTTAAACAAGATTACTGTGAGGA 



541 



-+ 600 



TCCCCTAGAATGTAAACTATTGTTACCTTGATAAAATAAATTTGTTCTAATGACACTCCT 
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CviJI 
NlaIV| 
Ecil | I 
Acil | I I 
I I II 



Hinf I 
Tf il 



Sthl32I 



Clal 
TaqI 
Dpnl| 
Sau3AI | I 
BscGI | j | Alwl 
I II 



601 



AAATGGCGGAGCCATTTCTACCAAGAATCTTTCTTTGAAAAACAGCACGGGATCGATTTC 
+ + + + + + 

TTTACCGCCTCGGTAAAGATGGTTCTTAGAAAGAAACTTTTTGTCGTGCCCTAGCTAAAG 



660 



Hhal 
CjePl| 
TaqI | | 



CjePI 



CviJI Mwol 



661 



TTTTG AAGGGAAT AAATCG AGCG CAACAGGGAAAAAAGGTGGGG CTATTTGTGCTACTGG 



AAAACTTCCCTTATTTAGCTCGCGTTGTCCCTTTTTTCCACCCCGATAAACACGATGACC 



-+ 720 



BspMI 
Sspl 
Bbvl 
Mnll 



Taal Earl 
Sfcl| MboII Hpyl78III 

Bsrl | | NlaIV| TaqI 

Rsal | j CviJI | | Beef I I 

I II III || 

TACrraTAGATATTACAAATAATACGGCTCCTACCCTCTTCTCGAACAATATTGCTGAAG 
721 + + + + + + 780 

ATGACATCTATAATGTTTATTATGCCGAGGATGGGAGAAGAGCTTGTTATAACGACTTCG 



Pnu4HI 
Sfcl 
Alul| 
CviJI | 
Mwol | 
Tselj 
II 



Eco57I 
Alul 
CviJI 
Mwol 
Aarl | 
Pstl| j 
CviRl | | j 



Tsp509I 
Rsal 
BsrGI | 
Taal | 
Tat I | 



BsmAI 
Maell BsmBI 



I 



TGCAGGTGGAGCTATAAATAGCACAGGAAACTGTACAATTACAGGGAATACGTCTCTTGT 
781 + + + + + + 840 

ACGTCCACCTCGATATTTATCGTGTCCTTTGACATGTTAATGTCCCTTATGCAGAGAACA 



Hpyl88IX 



Maelll 
Tsp4 5I 



Acil 
BsiEI Mnll 



BseRI 
Hpyl78III 
Banll 
BsiHKAI 
Bspl286I 
SacI 
SfaNI 
Alul | 
CviJI j ' j icjel 



ATTTTCTGAAAATAGTGTGACAGCGACCGCAGGAAATGGAGGAGCTCTTTCTGGAGATGC 
841 + + + + + + 900 

TAAAAGACTTTTATCACACTGTCGCTGGCGTCCTTTACCTCCTCGAGAAAGACCTCTACG 
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Mae I II- 



Maelll 
HpyX8 8IX 
Hinfl | 
BslI | | 
Bpml |TfiI I 
I I I I 



Hpyl78III 
Cjel Ddel | 

I I I 



BseMII 
SfcX 
Alul| Alul 
CviJI J CviJI 
II 



901 



CGATGTTACCATATCTGGGAATCAGAGTGTAACTTTCTCAGGAAACCAAGCTGTAGCTAA 



GCTACAATGGTATAGACCCTTAGTCTCACATTGAAAGAGTCCTTTGGTTCGACATCGATT 



- + 960 



Neil 
ScrFI 
BsaJI 
Mspl 
BsrI 
TspRI 
CviJI 



Sthl32I 
Alul 

CviJI CviJI 
NlaIV| Hindi I I | 

Ecil | | Ddel | | 

Acil| j | Mwol | | | 



BslI 



Hinfl 
Tfil 



TGGCGGAGCCATTTATGCTAAGAAGCTTACACTGGCTTCCGGGGGGGGGGGGGGGAATCC 
961 + + + + + + 1020 

ACCGCCTCGGTAAATACGATTCTTCGAATGTGACCGAAGGCCCCCCCCCCCCCCCTTAGG 



PstI 
TspRI 
CviRI 
HgiEII 
BtsI 
Sfcl 
Kpnl 
MlalV 



Rsal 
BanI 
BspMI 
BsaJI | 
Styl | 
Cjel| | 



CviJI 
NlalVl 



Xcml 
Cjel | 
I I 



1021 



CTTTTCTAACAATATAGTCCAAGGTACCACTGCAGGTAATGGTGGAGCCATTTCTATACT 



GAAAAGATTGTTATATCAGGTTCCATGGTGACGTCCATTACCACCTCGGTAAAGATATGA 



■+ 1080 
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Alul 
CviJI 
MspAlI 
PvuII 
Bsrl | 
Fnu4HI | 
Tsel| | 
II I 



Bbvl 



Bpml 



NlalV 
Avail] 
Sau96I| 



BsmFI 
Msel | 
Xcml | j 
I I 



1081 



GGCAGCTGGAGAGTGTAGTCTTTTCAGCGAAGCAGGGGACCATTACCTTAATGGGAATGC 



■+ 1140 



CCGTCGACCTCTCACATCAGAAAAGTCGCTTCGTCCCCTGGTAATGGAATTACCCTTACG 

Dpnl 
BstYI 

CviRI Sau3AI 
Mwol | Apol HaelV | 

Bsml | | Tsp509I Hin4I 

III I II 



Bcgl 
Alwl| 

II 



1141 



CATTGTTGCAACTACACCACAAACTACAAAAAGAAATTCTATTGACATAGGATCTACTGG 



•+ 1200 



GTAACAACGTTGATGTGGTGTTTGATGTTTTTCTTTAAGATAACTGTATCCTAGATGACC 



1201 



Hpyl88IX 
Dpnl 
Sau3AI 

CviRI Pf 111081 | 

BsaAI j SfaNI | | 

Maell | j Bcgl Alwl I II Bbvl 

ii i i nil. 

CAAAGATCACGAATTACGTGCAATATCTGGGCATAGCATCTTTTTCTACGATCCGATTAC 
H + + + + + 

GTTTCTAGTGCTTAATGCACGTTATAGACCCGTATCGTAGAAAAAGATGCTAGGCTAATG 



Tsp509I 
Hpyl78III 
Dpnl | 
Sau3AI | j 
Bsrl | | | 



1260 



Hinfl 
Tfil 
Acil 
Fnu4HI 
CviJI | 
Tsel| 
Mwol | j 

I II 



CviJI 

Beef I Dral BspMI | 

Sfcl | Msel | SfaNI | j CviRI 
II II III 



1261 



TGCTAATACGGCTGCGGATTCTACAGATACTTTAAATCTCAATAAGGCTGATGCAGGTAA 



- + 1320 



ACGATTATGCCGACGCCTAAGATGTCTATGJ\AATTTAGAGTTATTCCGACTACGTCCATT 



Hpyl88IX 
HphI | 

Rsal Taqi Alul | | 

Tat I | simi | CviJI 

I I M III 
TAGTACAGATTATAGTGGGTCGATTGTTTTTTCTGGTGAAAAGCTCTCTGAAGATGAAGC 
1321 + + + + + + 1380 

ATCA.TGTCTAATATCACCCAGCTAACAAAAAAGACCACTTTTCGAGAGACTTCTACTTCG 
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Eco57I 
CviRI 
BstAPI | 
Mwol | 
MboII I 



Mnll 



Maelll 
CviJI 
Pnu4HI | 
Mwol | j 
Tselj j 
I 



Apol 
Tsp509I 
PstI 
CviRI 
Sfcl | 
Eco57l| j 
Bbvl | | | 

M I 



1381 



AAAAGTTGCAGACAACCTCACTTCTACGCTGAAGCAGCCTGTAACTCTAACTGCAGGAAA 

+ + + + + + 1440 

TTTTCAACGTCTGTTGGAGTGAAGATGCGACTTCGTCGGACATTGAGATTGACGTCCTTT 



BseMIl 
NlalV 
SacII 
Acil 
MspAlI | 
Thai | 
Acil 
BsaJI 
BstDSI 
Hpyl88IX 

I Faul | 
| Hpyl7 8III Ddel| | 
|MaeIII | Sthl32l|| 
|Tsp45I TaqI CviJI | j j 

II I I III 
TTOAGTACTTAAACGTGGTGTCACTCTCGATACGAAAGGC 

1441 + + + + + + 150Q 

AAATCATGAATTTGCACCACAGTGAGAGCTATGCTTTCCGAAATGAGTCTGGCGCCCAAG 



Mae 1 1 
Msel | 
Rsal | | 
Seal | 
Tat I | | 

I I I 



Msel 



Faul 
Mnll 
Sthl32l| 
Sf aNI | j 



Acil 
Fokl 
Mae 1 1 
Bspl286I 
Bmgl 
BseSI 
Cac8I | 
Acil | j 



Mnll 
Rsal 
Tat I | 
Mnll | j 



Maelll 
Tsp45I 



BseRI 
Msel t 



1501 



CTCTGTTATTATGGATGCGGGCACAACGTTAAAAGCAAGTACAGAGGAGGTCACTTTAAC 



GAGACAATAATACCTACGCCCGTGTTGCAATTTTCGTTCATGTCTCCTCCAGTGAAATTG 



- + 1560 



Hinfl 
AccI | 
Sfcl | j 
Plel| j j Mnll 



Fnu4HI Sfcl 
Bbvl Tsp509I Tsel| Mwol | 



AGGTCTTTCCATTCCTGTAGACTCITTAGGCGAGGGTAAGAAAGTTGTAATTGCTGCTTC 
1561 + + + - + + + 162 o 

TCCAGAAAGGTAAGGACATCTGAGAAATCCGCTCCCATTCTTTCAACATTAACGACGAAG 
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Hinfl 

PstI Tfil 
Fnu4Hl| Hpyl88IX 
CviRl|| Avail | 

Mwol | j Ddel MboII j 

Tseljj Bbvl CviJI | Sau96I | 

III I II II 

TGCAGCAAGTAAAAATGTAGCCCTTAGTGGTCCGATTCTTCTTTTGGATAACCAAGGGAA 



BsaJI 
Styl 



1621 



ACGTCGTTCATTTTTACATCGGGAATCACCAGGCTAAGAAGAAAACCTATTGGTTCCCTT 



-+ 1680 



Ddel 



1681 



Alul 
CvitJI 
Fnu4HI | 

Hpyl78III CviRl| | 

Smll | Tsel 

I I I II III 

TGCTTATGAAAATCACGACTTAGGAAAAACT^^ 



Bce83I 
BsmI Hpyl78III 



I 



ACGAATACTTTTAGTGCTGAATCCTTTTTGAGTTCTGAAAAGTAAACACGTCGAGAGACG 



-+ 1740 



SfCl 
NlalV | 
Acil | j Cjel 
MspAlI j |TaaI | 



CviRI 
Acelll Bsgl | 
Bbvl | Rsal| j Sfcl 
II III I 

TCTGGGTACTGCAACAACTACAG ATGTTC CAG CGG TTCCTACAGTAG CAACTCCTACGCA 



1741 



1801 



AGACCCATGACGTTGTTGATGTCTACAAGGTCGCCAAGGATGTCATCGTTGAGGATGCGT 

Cjel Cac8l 
Rsal | Acil I 

ii ii 

CTATGGGTATCAAGGTACTTGGGGAATGACTTGGGTTG ATGATACCG CAAGC ACT CCAAA 



- + 1800 



GATACCCATAG1TCCATGAACCCCTTACTGAACCCAACTACTATGGCGTTCGTGAGGTTT 



- + 1860 



Tthlllll 
Ddel I 



Avail 
Sau96I 
Alul | 
CviJI j 
Mwol | j 



I I I 



BpulOI 
Ddel 
Hpyl7 8III 
Hinfl 
Tfil 
Hgal 



Hpyl88IX 
BseMII I 



CviJI 
I 



1861 



GACTAAGACAGCGACATTAGCTTGGACCAATACAGGCTACCTTCCGAATCCTGAGCGTCA 
+ + + - + + + 

CTGATTCTGTCGCTGTAATCGAACCTGGTTATGTCCGATGGAAGGCTTAGGACTCGCAGT 



1920 
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Dpni 
Bstyi | 

Fokl |Hpyl88IX 
CviJI Sau3AI Alwl I 
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Hinf I 
Tfil 



AGGACCTTTAGTTCCTAATAGCCTTTGGGGATCTTTTTCAGACATCCAAGCGATTCAAGG 
1921 + + + + + + 198O 

TCCTGGAAATCAAGGATTATCGGAAACCCCTAGAAAAAGTCTGTAGGTTCGCTAAGTTCC 



Mwol 
Hinf I 
Bbvl Acil 
TaqI Fnu4HI 
Dpni | CviJI | 

Sau3AI | I Paul | j 

Hpyl88IX| | j Sthl32l| j j 
Mnll|| j j CviJI | Tsel 

I I Ml II I II || 
TGTCATAGAGAGAAGTGCTTTGACTCTTTGTTCAGATCGAGGCTTCTGGGCTGCGGGAGT 
1981 + + + + + : + 2040 

ACAGTATCTCTCTTCACGAAACTGAGAAACAAGTCTAGCTCCGAAGACCCGACGCCCTCA 



Plel Hinf I 



Plel 
Tsp509l| Ddel 



Taal 



CGCCAATTTCTTAGATAAAGATAAGAAAGGGGAAAAACGCAAATACCGTCATAAATCTGG 
2041 + + + + + + 2100 

GCGGTTAAAGAATCTATTTCTATTCTTTCCCCTTTTTGCGTTTATGGCAGTATTTAGACC 



Hpyl88IX 

Mnll | Fnu4HI 

HaeIV| j CviRl| 
Hin4l| j Tsel |HhaI 

II I II I 

TGGATATGCTATCGGAGGTGCAGCGCAAACTTGTTCTGAAAACTTAATTAGCTTTGCCTT 

2101 + + + + + + 2160 

ACCTATACGATAGCCTCCACGTCGCGTTTGAACAAGACTTTTGAATTAATCGAAACGGAA 



Tsp509I 
Bsgl | 
Hpyl88IX | | Alul 

Bbvl | j Mselj CviJI 



BstXI Ddel 

I I 

TTGCCAACTCTTTGGTAGCGATAAAGATTTCTTAGTCGCTAAAAATCATACTGATACCTA 

2161 + + + + + + 2220 

AACGGTTGAGAAACCATCGCTATTTCTAAAGAATCAGCGATTTTTAGTATGACTATGGAT 



CviJI 
NlalVl 



CviRI 



Bsbl 



Mmel 



BsmAl 
Ddel I 



TGCAGGAGCCTTCTATATCCAACACATTACAGAATGTAGTGGGTTCATAGGTTGTCTCTT 
2221 + + + + + + 2280 

ACGTCCTCGGAAGATATAGGTTGTGTAATGTCTTACATCACCCAAGTATCCAACAGAGAA 
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CviJl 
ScrFI | 
EcoRII | | Hinfl 
III I 



Plel 



c icai 

Alul 
CviJI 
Fnu4HI | 
Mnll Tsel| | 

II 



AGATAAACTTCCTGGCTCTTGGAGTCATAAACCCCTCGTTTTAGAAGGGCAGCTCGCTTA 

2281 + + + + + + 2340 

TCTATTTGAAGGACCGAGAACCTCAGTATTTGGGGAGCAAAATCTTCCCGTCGAGCGAAT 



Mae 1 1 
CviJI | 
Acelll | j 
BbvX| j j 
III I 



8su36I 
Ddel 
Hpyl78III 
Eco57I 
BseMII 
MboII 



Hpyl88IX 
Dpnl | 
Sau3AI | j 
I I I 



BstZ17l| 
Accl | j 
Bbsl | | | 

II 



Mnll 



BciVI 



I 



TAGCCACGTCAGTAATGATCTGAAGACAAAGTATACTGCGTATCCTGAGGTGAAAGGTTC 
2341 + + + --- + + + 2400 

ATCGGTGCAGTCATTACTAGACTTCTGTTTCATATGACGCATAGGACTCCACTTTCCAAG 



HphI 



MboII Alul 
Msel Nlalll I CviJI 



I 



Hpyl78III 



TTGGGGGAATAATGCTTTTAACATGATGTTGGGAGCTTCTTCTCATTCTTATCCTGAATA 
2401 + + + + + + 2460 

AACCCCCTTATTACGAAAATTGTACTACAACCCTCGAAGAAGAGTAAGAATAGGACTTAT 



Hpyl88IX 

BspMI Hinfl | Hpyl78III 

CviKI | BsaXI Tfil Maell I 

I I ! II II 

CCTGCATTGTTTTGATACCTATGCTCCATACATCyU^CTGAATCrGACCTATATACGTCA 
2461 + + + + + + 2520 

GGACGTAACAAAACTATGGATACGAGGTATGTAGTTTGACTTAGACTGGATATATGCAGT 



Hpyl88IX 
Alul | 
CviJI | 
Hin4I 



Rsal 



Dpnl 
Bglll | 
BstYI | 
Sau3AI j 



MboII 
MboII I 



Earl 
Tsp509l| 



GGACAGCTTCTCGGAGAAAGGTACAGAAGGAAGATCTTTTGATGAGAGCAACCTCTTCAA 
2521 + + + + + + 2580 

CCTGTCGAAGAGCCTCTTTCCATGTCTTCCTTCTAGAAAACTACTGTCGTTGGAGAAGTT 



Hpyl88IX 
Hin4I | 

Mnll Sfcl HphI || BplI 

I I I I I I 
TTTATCTTTGCCTATAGGGGTGAAGTTTGAGAAGTTCTCTGATTGTAATGACTTTTCTTA 
2581 + + + + + + 2640 

AAATAGAAACGGATATCCCCACTTCAAACTCTTCAAGAGACTAACATTACTGAAAAGAAT 
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Hpyl88IX 
Dpnl | 
Sau3AI | j 
III 



Dpnl 
Sau3AI | 
Hpyl78IIl| | 
II I 



BsrDI 
Dpnl 
Sau3AI ) 
Alwl | j 

Acil| | | 

II 



Sfcl 
CviRI | 

I I 



2641 



TGATCTGACTTTATCCTATGTTCCTGATCTTATCCGCAATGATCCCAAATGCACTACAGC 



ACTAGACTGAAATAGGATACAAGGACTAGAATAGGCGTTACTAGGGTTTACGTGATGTCG 



-+ 2700 



2701 



CviJI 

CviJI Hael 
NlaIV| HaelXI 
Acil | I StuI 
MspAlI | | Mnll Ddel Mwol I 

i n i i ii 

ACTTGTAATCAGCGGAGCCTCTTGGGAAACTTATGCCAATAACTTAGCACGACAGGCCTT 



TGAACATTAGTCGCCTCGGAGAACCCTTTGAATACGGTTATTGAATCGTGCTGTCCGGAA 



•+ 2760 



Bsrl 
CviJI 

Maelll Haelll 
Tsp45I Eael 
Cac8I | Gdill 
CviRI | j Bsgl BsiHKAI | 
CviRI Cac8I | j | Tthlllll | Bspl286I 
I I I I I II || 
GCAAGTG CGTGCAGGC AGTCACTACGCCTTCTCTCCTATGTTTGAAGTGCTCGGC CAGTT 
2761 + + + + + + 2820 

CGTTCACGCACGTCCGTCAGTGATGCGGAAGAGAGGATACAAACTTCACGAGCCGGTCAA 



2821 



Dpnl 
NlalV 

BamHI | Dpnl 

BstYI | Bglll | 

Sau3AI | BstYI j 

Alwl | j Alwl Mnll Sau3AI | 

I I I I I II | 
TGTCTTTGAAGTTCGTGGATCCTCACGGATTTATAATGTAGATCTTGGGGGTAAGTTCCA 



Hgal 
Tsp509I 



ACAGAAACTTCAAGCACCTAGGAGTGCCTAAATATTACATCTAGAACCCCCATTCAAGGT 



-+ 2880 
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Dpnl 
Sau3AI 
BseMII 
Ddel 
Hpyl88IX 
BplI 
Apol 
Tsp509I 
BsmAl 
Hpyl8 8IX| 
BseMII | 
Ddel | 
Nlalll 
BsmAI | 
Bfal BsmBI j 

BslI Hin4I | I I I I I I || | II Msel 

l III 

ATTCTAGGAGCGTCTCTCATGTCTCAGAAATTCTGAGAGAGATCGCATTTAGGATTTTCT 



2881 



-+ 2940 



TAAGATCCTCGCAGAGAGTACAGAGTCTTTAAGACTCTCTCTAGCGTAAATCCTAAAAGA 



TAAACACGAC 

2941 + 2950 

ATTTGTGCTG 
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Figure 5: Nucleotide and deduced amino acid sequence of CPN1 00638 



tgtagattct taacttacta gtctctcctt tcctcttgct ttctttraatt tattgcagta 60 - 

tgtggtgaaa taatttgtta aaccacctat agccctctac atg aaa tec tct ctt 115 

Met Lys Ser Ser Leu 
1 5 

cat tgg ttt tta ate teg tea tct tta gca ctt ccc ttg tea eta aat 163 
His Trp Phe Leu lie Ser Ser Ser Leu Ala Leu Pro Leu Ser Leu Asn 
10 15 20 

ttc tct gcg ttt get get gtt gtt gaa ate aat eta gga cct ace aat 211 
Phe Ser Ala Phe Ala Ala Val Val Glu lie Asn Leu Gly Pro Thr Asn 
25 30 35 

age ttc tct gga cca gga ace tac act cct cca gee caa aca aca aat 259 
Ser Phe Ser Gly Pro Gly Thr Tyr Thr Pro Pro Ala Gin Thr Thr Asn 
40 45 50 

gca gat gga act ate tat aat eta aca ggg gat gtc tea ate ace aat 3 07 
Ala Asp Gly Thr lie Tyr Asn Leu Thr Gly Asp Val Ser lie Thr Asn 
55 60 65 

gca gga tct ccg aca get eta ace get tec tgc ttt aaa gaa act act 355 
Ala Gly Ser Pro Thr Ala Leu Thr Ala Ser Cys Phe Lys Glu Thr Thr 
70 75 80 85 

ggg aat ctt tct ttc caa ggc cac ggc tac caa ttt etc eta caa aat 403 
Gly Asn Leu Ser Phe Gin Gly His Gly Tyr Gin Phe Leu Leu Gin Asn 
90 95 100 

ate gat gcg gga gcg aac tgt ace ttt ace aat aca get gca aat aag 451 
lie Asp Ala Gly Ala Asn Cys Thr Phe Thr Asn Thr Ala Ala Asn Lys 
105 110 115 

ctt etc tec ttt tea gga ttc tec tat ttg tea eta ata caa ace acg 499 
Leu Leu Ser Phe Ser Gly Phe Ser Tyr Leu Ser Leu lie Gin Thr Thr 
120 125 130 

aat get acc aca gga aca gga gee ate aag tec aca gga get tgt tct 547 
Asn Ala Thr Thr Gly Thr Gly Ala lie Lys Ser Thr Gly Ala Cys Ser 
135 140 145 

att cag teg aac tat agt tgc tac ttt ggc caa aac ttt tct aat gac 595 
lie Gin Ser Asn Tyr Ser Cys Tyr Phe Gly Gin Asn Phe Ser Asn Asp 
150 155 160 165 

aat gca ggc gee etc caa ggc age ten acc agu cca teg eta aac ccc 643 
Asn Gly Gly Ala Leu Gin Gly Ser Ser He Ser Leu Ser Leu Asn Pro 
170 175 180 

aac eta acg ttt gee aaa aac aaa gca acg caa aaa ggg ggt gec etc 691 
Asn Leu Thr Phe Ala Lys Asn Lys Ala Thr Gin Lys Gly Sly Ala Leu 
185 190 " . " iss 
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tat tec acg gga ggg f;t aca att aac aat acg tta aac tea gca tea 
Tyr Ser Thr Gly Gly lie Thr lie Asn Asn Thr Leu Asn Ser Ala Ser 
200 205 210 



739 



ttt tct gaa aat acc gcg gcg aac aat ggc gga gec att tac acg gaa 
Phe Ser Glu Asn Thr Ala Ala Asn Asn Gly Gly Ala lie Tyr Thr Glu 
215 220 225 



787 



get age agt ttt att age age aac aaa 
Ala Ser Ser Phe lie Ser Ser Asn Lys 
230 235 

a 9t 9^-9 acc gca acc tea get aca ggg 
Ser Val Thr Ala Thr Ser Ala Thr Gly 
250 

aca tea gee ccc aaa cca gtc tta act 
Thr Ser Ala Pro Lys Pro Val Leu Thr 
265 270 



gca att age ttt ata aac aat 835 
Ala lie Ser Phe lie Asn Asn 
240 245 

gga gec att tac tgt agt agt 883 
Gly Ala lie Tyr Cys Ser Ser 
255 260 

eta tea gac aac ggg gaa ctg 931 
Leu Ser Asp Asn Gly Glu Leu 
275 



aac ttt ata gga aat aca gca att act agt ggt ggg gcg att tat act 979 
Asn Phe lie Gly Asn Thr Ala lie Thr Ser Gly Gly Ala lie Tyr Thr 
280 285 290 

gac aat eta gtt ctt tct tct gga gga cct acg ctt ttt aaa aac aac 1027 
Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr Leu Phe Lys Asn Asn 
295 300 305 



tct ggc tat gat act gca get ccc tta gga gga gca att gcg att get 1075 
Ser Gly Tyr Asp Thr Ala Ala Pro Leu Gly Gly Ala lie Ala lie Ala 
310 315 320 325 



gac tct gga tct ttg agt ctt teg get ctt ggt gga gac ate act ttt 1123 
Asp Ser Gly Ser Leu Ser Leu Ser Ala Leu Gly Gly Asp He Thr Phe 
330 335 340 



gaa gga aac aca gta gtc aaa gga get tct teg agt cag acc act acc 1171 
Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser Ser Gin Thr Thr Thr 
345 350 355 

aga aat tct att aac ate gga aac acc aat get aag att gta cag ctg 1219 
Arg Asn Ser He Asn He Gly Asn Thr Asn Ala Lys He Val Gin Leu 
360 365 370 

cga gec tct caa ggc aat act ate tac ttc tat gat cct ata aca act 1267 
Arg Ala Ser Gin Gly Asn Thr He Tyr Phe Tyr Asp Pro He Thr Thr 
375 380 " 385 

age ate acc gca get etc tea gat get eta aac tta aat ggt cct gac 1315 
Ser He Thr Ala Ala Leu Ser Asp Ala Leu Asn Leu Asn Gly Pro Asp 
390 395 400 " 405 

ctt gca ggg aat cct gca tat caa gga acc ate gta ttt tct gga gag j.3 63 
Leu Ala Gly Asn Pro Ala Tyr Gin Gly Thr He Val Phe Ser Gly Glu 
410 . 415 420 



SUBSTITUTE SHEET (RULE 26) 



raSW Title 



WO 00/32794 



Fig. 5 (con't) 



CHLAMYDIA ANTIGENS AND ^ 
CORRESPONDING DNA A 
FRAGMENTS AND USES THEREO^ 
Inventor(s): Andrew D. MURDIN et al. 
DOCKET NO.: 032931-0252 



09/8571 

PCT/CA99/01147 



aag etc teg gaa gca gaa get gca gaa get gat aat etc aaa tct aca 
Lys Leu Ser Glu Ala Glu Ala Ala Glu Ala Asp Asn Leu Lys Ser Thr 
425 430 435 



1411 



att cag caa cct eta act ctt gcg gga ggg caa etc tct ctt aaa tea 
lie Gin Gin Pro Leu Thr Leu Ala Gly Gly Gin Leu Ser Leu Lys Ser 
440 445 450 



1459 



gga gtc act eta gtt get aag tec ttt 
Gly Val Thr Leu Val Ala Lys Ser Phe 
455 460 

etc etc atg gat gca ggg ace aca tta 
Leu Leu Met Asp Ala Gly Thr Thr Leu 
470 475 

ate aat aat ctt gtt etc aat gta gat 
Xle Asn Asn Leu Val Leu Asn Val Asp 
490 

ggc acg eta aaa gca aca caa gca agt 
Gly Thr Leu Lys Ala Thr Gin Ala Ser 
505 510 



teg caa tct ccg ggc tct ace 1507 
Ser Gin Ser Pro Gly Ser Thr 
465 

gaa acc gec gat ggg ate act 1555 
Glu Thr Ala Asp Gly lie Thr 
480 485 

tec tta aaa gag acc aag aag 1603 
Ser Leu Lys Glu Thr Lys Lys 
495 500 

cag aca g:c act tta tct gga 1651 
Gin Thr Val Thr Leu Ser Gly 
515 



teg etc tct ctt gta gat cct tct gga aat gtc tac gaa gat gtc tct 1699 
Ser Leu Ser Leu Val Asp Pro Ser Gly Asn Val Tyr Glu Asp Val Ser 
520 525 ' 530 



tgg aat aac cct caa gtc ttt tct tgt etc act ctt act get gac gac 1747 
Trp Asn Asn Pro Gin Val Phe Ser Cys Leu Thr Leu Thr Ala Asp Asp 
535 540 545 

ccc gcg aat att cac ate aca gac tta get get gat ccc eta gaa aaa 1795 
Pro Ala Asn lie His lie Thr Asp Leu Ala Ala Asp Pro Leu Glu Lys 
550 555 560 565 



aat cct ate cat tgg gga tac caa ggg aat tgg gca tta tct tgg caa 1843 
Asn Pro lie His Trp Gly Tyr Gin Gly Asn Trp Ala Leu Ser Trp Gin 
570 575 580 

gag gat act gcg act aaa tec aaa gca gcg act ctt acc tgg aca aaa 1891 
Glu Asp Thr Ala Thr Lys Ser Lys Ala Ala Thr Leu Thr Trp Thr Lys 
585 590 595 

aca gga tac aat ccg aat cct gag cgt cgt gga acc tta gtt get aac 1939 
Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Gly Thr Leu Val Ala Asn 
600 605 610 



acg eta tgg gga tec ttt gtt gat gtg cgc tec a:a caa cag ctt gta 1987 

Thr Leu Trp Gly Ser Phe Val Asp Val Arg Ser lie Gin Gin Leu Val 

615 620 625 

gee act aaa gta cgc caa tct caa gaa act cgc ggc ate tgg tqt gaa 2035 

Ala Thr Lys Val Arg Gin Ser Gin Glu Thr Arg Gly He Trp Cys Glu 
630 635 640 * 645 
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ggg ate teg aac ttc ttc cat aaa gat age acg aag ata aat aaa ggt 2 083 
Gly lie Ser Asn Phe Phe His Lys Asp Ser Thr Lys lie Asn Lys Gly 
650 655 660 

ttt cgc cac ata agt gca ggt tat gtt gta gga gcg act aca aca tta 213i 
Phe Arg His lie Ser Ala Gly Tyr Val Val Gly Ala Thr Thr Thr Leu 
665 670 675 

get tct gat aat ctt ate act gca gee ttc tgc caa tta ttc ggg aaa 2179 
Ala Ser Asp Asn Leu lie Thr Ala Ala Phe Cys Gin Leu Phe Gly Lys 
680 685 690 

gat aga gat cac ttt ata aat aaa aat aga get tct gec tat gca get 2227 
Asp Arg Asp His Phe He Asn Lys Asn Arg Ala Ser Ala Tyr Ala Ala 
695 700 705 

tct etc cat etc cag cat eta gcg acc ttg tct tct cca age ttg tta 2275 
Ser Leu His Leu Gin His Leu Ala Thr Leu Ser Ser Pro Ser Leu Leu 
710 715 720 725 

cgc tac ctt cct gga tct gaa agt gag cag cct gtc etc ttt gat get 2323 
Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro Val Leu Phe Asp Ala 
730 735 740 

cag ate age tat ate tat agt aaa aat act atg aaa acc tat tac acc 2371 
Gin He Ser Tyr He Tyr Ser Lys Asn Thr Met Lys Thr Tyr Tyr Thr 
745 750 755 

caa gca cca aag gga gag age teg tgg tat aat gac ggt tgc get ctg 2419 
Gin Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn Asp Gly Cys Ala Leu 
760 765 770 

gaa ctt gcg age tec eta cca cac act get tta age cat gag ggt etc 24 67 
Glu Leu Ala Ser Ser Leu Pro His Thr Ala Leu Ser His Glu Gly Leu 
775 780 785 

ttc cac gcg tat ttt cct ttc ate aaa gta gaa get teg tac ata cac 2515 
Phe His Ala Tyr Phe Pro Phe He Lys Val Glu Ala Ser Tyr He His 
790 795 800 805 

caa gat age ttc aaa gaa cgt aat act acc ttg gta cga tct ttc gat 2563 
Gin Asp Ser Phe Lys Glu Arg Asn Thr Thr Leu Val Arg Ser Phe Asp 
810 815 820 

age ggt gat tta att aac gtc tct gtg cct att gga att acc ttc gag 2611 
Ser Gly Asp Leu lie Asn Val Ser Val Pro He Gly He Thr Phe Glu 
825 830 835 

aga ttc teg aga aac gag cgt gcg tct tac gaa get act gtc ate tac 2659 
Arg Phe Ser Arg Asn Glu Arg Ala Ser Tyr Glu Ala Thr Val lie Tyr 
840 845 850 

gtt gee gat gtc tat cgt aag aat cct gac cgc acg aca get etc eta 2707 
Val Ala Asp Val Tyr Arg Lys Asn Pro Asp Cys Thr Thr Ala Leu beu 
855 860 " S55 
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ate aac aat acc teg tgg aaa act aca gga acg aat etc tea aga caa 2 755 
lie Asn Asn Thr Ser Trp hys Thr Thr G? y Thr Asn Leu Ser Arg Gin 
870 875 880 885 

get ggt ate gga aga gca ggg ate ttt tat gee ttc tct cca aat ctt 2 80} 
Ala Gly He Gly Arg Ala Gly He Phe Tyr Ala Phe Ser Pro Asn Leu 
890 895 900 

gag gtc aca agt aac eta tct atg gaa att cgt gga tct tea cgc age 2851 
Glu Val Thr Ser Asn Leu Ser Met Glu He Arg Gly Ser Ser Arg Ser 
905 910 915 

tac aat gca gat ctt gga ggt aag ttc cag ttc taaaagcgtt cctgatccct 2904 
Tyr Asn Ala Asp Leu Gly Gly Lys Phe Gin Phe 
920 925 

tagaaattct aagagatcct gagtgtatct agggacttct caaaga 2950 
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Figure 6: Restriction enzyme analysis of CPN1 00638 



Tsp509I 
Msel | 
Mnll I 



Hinfl Bfal 
Tfil Msel Spel| BsmAI Mnll II CviRl 

i i ii i i ii i 

TGTAGATTCTTAACTTACTAGTCTCTCCTTTCCTCTTGCTTTCTTTAATTTATTGCAGTA 



ACATCTAAGAATTGAATGATCAGAGAGGAAAGGAGAACGAAAGAAATTAAATAACGTCAT 



-+ 60 



MseX 

Tsp509I HphI | 



61 



Mboll 

CviJI Mnll Mnll 

Sfcl i Nlalll) Earl | 

I I II II 
TGTGGTGAAATAATTTGTTAAACCACCTATAGCCCTCTACATGAAATCCTCTCTTCATTG 
+ h + + + + 

ACACCACTTTATTAAACAATTTGGTGGATATCGGGAGATGTACTTTAGGAGAGAAGTAAC 



120 



Bbvl 
Apol 
Tsp509I 



Maelll 
Tsp45I 



121 



Msel Tsp45I | 

I 

GTTTTTAATCTCGTCATCTTTAGCACTTCCCTTGTCACTAAATTTCTCTGCGTTTGCTGC 



Fnu4HI 
Mwol | 
Tselj 



-+ 180 



C^UW^TTAGAGCAGTAGAAATCGTGAAGGGAACAGTGATTTAAAGAGACGCAAACGACG 



NlalV 
BseRI 
Bpml 
ScrFI 
EcoRII 

Avail Avail 
EcoO109I BspGI 
PspSII Sau96I 
Sau96I Hpyl78IIl| | | | (BsaXI 

Sse8647I Alul || j | ||AloI| 

Bfal | CviJI II I I Ppill 

II I II I I II II 

TGTTGTTGAAATCAATCTAGGACCTACCAATAGCTTCTCTGGACCAGGAACCTACACTCC 

181 - + - + + -- + + + 240 

ACAACAACTTTAGTTAGATCCTGGATGGTTATCGAAGAGACCTGGTCCTTGGATGTGAGG 



Mnll 



CviJI 



I 



Bed 
TthlllII| 
CviRI I j 



HphI BsmAI 



TCCAGCCCAAACAACAAATGCAGATGGAACTATCTATAATCTAACAGGGGATGTCTCAAT 
241 - + + + + + + 300 

AGGTCGGGTTTGTTGTTTACGTCTACCTTGATAGATATTAGATTGTCCCCTACAGAGTTA 
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Alul 
CviJI 
Alwl 
Hpyl8 8IX 
Dpnl 



BstYI | 
Sau3AI j 
Fokl CviRI | j 

! Ill 



Acelll 
Acil | 
Mwol | | 
II I 



Dral 
Msel| 
Mmel | | 

II 



BsrI 
Hinf I 
Tfil 
Bmrl I 



I I 



301 



CACCAATGCAGGATCTCCGACAGCTCTAACCGCTTCCTGCTTTAAAGAAACTACTGGGAA 
+ + + + + + 

GTGGTTACGTCCTAGAGGCTGTCGAGATTGGCGAAGGACGAAATTTCTTTGATGACCCTT 



360 



BsaJI 
BstDSI 
CviJI | 
Hael j 
Haelllj 
BsaJI | j Tsp509I 

Sty I | | CviJI | 



361 



Clal 
Taql 
Faul 
Sthl32I | 
SfaNI | j 

I Beef I | I)) Acil 

I I i II 

TCTTTCTTTCCAAGGCCACGGCTACCAATTTCTCCTACAAAATATCGATGCGGGAGCGAA 



-+ 420 



AGAAAGAAAGGTTCCGGTGCCGATGGTTAAAGAGGATGTTTTATAGCTACGCCCTCGCTT 



Bbvl 
Rsal | 
Taal | j 



Alul 
CviJI 
Hindlll 
CviRI 
Fnu4HI 
Alul | 
CviJI | 
MspAlI j 
PvuII | 
Tsel 



Hinf I 
Tfil 
Hpyl7 8III | 



Maelll 
Tsp45I 



421 



CTGTACCTTTACCAATACAGCTGCAAATAAGCTTCTCTCCTTTTCAGGATTCTCCTATTT 



-+ 480 



GACATGGAAATGGTTATGTCGACGTTTATTCGAAGAGAGGAAAAGTCCTAAGAGGATAAA 



Bed 
CviJI | 
NlalV) j 



Alul 
CviJI 



BsmI 

i 

GTCACTAATACAAACCACGAATGCTACCACAGGAACAGGAGCCATCAAGTCCACAGGAGC 
481 + -- - + + + + + 5 40 

CAGTGATTATGTTTGGTGCTTACGATGGTGTCCTTGTCCTCGGTAGTTCAGGTGTCCTCG 
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Mill I 



CviJI 
Hael 
Haelll 
MscI 
Eael | 
Mwol | | 

Taql Sfcl CjePI | j j 

I I I II I | 
TTGTTCTATTCAGTCGAACTATAGTTGCTACTTTGGCCAAAACTTTTCTAATGACAATGG 
541 + + + + + + 600 

AACAAGATAAGTCAG CTTGATATCAACGATGAAACCGGTTTTGAAAAGATTACTGTTACC 

BsaJI 
Hael I 
Hhal 
Hin4I 
NlalV 
BsaHI 
Narl 
Banl| 
CjePI j 
II 



Alul 
CviJI 

Mnll| Acelll 
| Fnu4HI | j Bbvl | Acll 

|StyITsel||| BplI || CjePI Maell 
Mil I II I | 

AGGCGCCCTCCAAGGCAGCTCTATCAGTCTATCGCTAAACCCCAACCTAACGTTTGCCAA 

601 + + + + + + 660 

TCCGCGGGAGGTTCCGTCGAGATAGTCAGATAGCGATTTGGGGTTGGATTGCAAACGGTT 



Sthl32I 
Bspl286I 
Bmgl 
BseSI 
NlaIV| 
BanI | j 

I II 



BslI 
BscGI 
Mnll 
BsaJI | 
BstDSI j 
Mnll | I 



Msel 
Tsp509I | 
I I 



AAACAAAGCAACGCAAAAAGGGGGTGCCCTCTATTCCACGGGAGGGATTACAATTAACAA 

661 111 + + + + + + 720 

.TTTGTTTCGTTGCGTTTTTCCCCCACGGGAGATAAGGTGCCCTCCCTAATGTTAATTGTT 

Fnu4HI 
SacII 
Taul 
Acil| 
MspAlI j 

Thai | CviJI 
BseMII Acil | j NlalVj 

Hpyl88IX BsaJI | j Ecil | j 

SfaNl| BstDSI j) Acil | 

II I M II II 
TACGTTAAACTCAGCATCATTTTCTGAAAATACCGCGGCGAACAATGGCGGAGCCATTTA 
721 + + + + + + 780 

ATGCAATTTGAGTCGTAGTAAAAGACTTTTATGGCGCCGCTTGTTACCGCCTCGGTAAAT 



Msel 
Mae I I j Ddel 

I I I 
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Cac8I 
Bfal 
Alul| 
Cvi JI j 
Nhelj 
II 



Fnu4HI 
Tsel I 



Bbvl Alul 
Tsp5 09I CviJI 
I 



Maelll 
Tsp45I 



781 



CACGGAAGCTAGCAGTTTTATTAGCAGCAACAAAGCAATTAGCTTTATAAACAATAGTGT 
+ ~+ + + + + 

GTGCCTTCGATCGTCAAAATAATCGTCGTTGTTTCGTTAATCGAAATATTTGTTATCACA 



840 



CviJI 
NlalV 
BseMII 



Mnll 
Cjel 
Sfcl 
Alul 



CviJI 
Mwol 
BbvCI | 
BpulOI j 
Acil Ddelj 
I II 



Taal Rsal 
Sfcl | TatI | 
II 



Cjel 
CviJI | 

I I 



BsrI 
I 



841 



GACCGCAACCTC^GCTACAGGGGGAGCCATTTACTGTAGTAGTACATCAGCCCCCAAACC 



CTGGCGTTGGAGTCGATGTCCCCCTCGGTAAATGACATCATCATGTAGTCGGGGGTTTGG 



-+ 900 



Tsp509I Bfal 
Cjel | Spel| 



HpylSSIX 
Msel Sthl32I | BscGI 
I II I 

AGTCTTAACTCTATCAGAC^VACGGGGAACTGAACTTTATAGGAAATACAGCAATTACTAG 

- + "--- + + + + + 960 

TCAGAATTGAGATAGTCTGTTG C CCCTTGACTTGAAATATCCTTTATGTCGTTAATGATC 



961 



Mboll 
Cjel Bfal | 
I II 

TGGTGGGGCGATTTATACTGACAATCTAGTTCTTTCTTCTGGAGGACCTACGCTTTTTAA 



Avail 
EcoO109I 
PspSII 
Sau96I 
Sse8647I 
Hpyl78III | 
Mnll | j 



Bpml 
Dral | 
Msel | j 



ACCACCCCGCTAAATATGACTGTTAGATCAAGAAAGAAGACCTCCTGGATGCGAAAAATT 



1020 
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Muni 
TspSOSI 
Acelll 
Bsu36I 
Ddel 
Mnll 
Alul 
CviJI 
PstI | 
BsaXX | 
Fnu4HI j 
CviRI | j 
Tselj | | | |BbvI 
CviJI Sfcl | | j j * | |BslI 

I I III 

AAACAACTCTGGCTATGATACTGCAGCTCCCTTAGGAGGAGCAATTGCGATTGCTGACTC 
1021 + + + + + + 10 80 

TTTGTTGAGACCGATACTATGACGTCGAGGGAATCCTCCTCGTTAACGCTAACGACTGAG 



Haeiv 
BseRI Hin4I 
Plel |HinfI | 



Dpnl 
BstYI | 
Sau3AI | Alwl 
Hpyl78III| j Hinfl 



BsmAI 
CviJI | 
Plel | j 



Taal 



TGGATCTTTGAGTCIU'TCGGCTCTTGGTGGAGACATCACTTTTGAAGGAAACACAGTAGT 
1081 + + + + + + 1140 

ACCTAGAAACTCAGAAAGCCGAGAACCACCTCTGTAGTGAAAACTTCCTTTGTGTCATCA 



Alul 
CviJI 
MboII | 



Hpyl88IX 
Hinfl | 
Taql | | Plel 



Apol 
Tsp509I 



Hpyl88IX 
Msel 1 



CAAAGGAGCTTCTTCGAGTCAGACCACTACCAGAAATTCTATTAACATCGGAAACACCAA 
1141 + + + + + + 1200 

GTTTCCTCGAAGAAGCTCAGTCTGGTGATGGTCTTTAAGATAATTGTAGCCTTTGTGGTT 



CviJI 
Cac8I 
Fnu4HI 
Alul | 
CviJI j 
MspAlI | 
Pvullj 
Bce83I | j 
Rsal j | 
Bbvl BsrGI | j j 

Ddel Tatl | Tselj | |SmlI Mnll 
I I I II 

TGCTAAGATTGTACAGCTGCGAGCCTCTCAAGGCAATACTATCTACTTCTATGATCCTAT 
1201 + + --- + + + + 126O 

ACGATTCT7VACATGTCGACGCTCGGAGAGTTCCGTTATGATAGATGAAGATACTAGGATA 



Dpnl 
Sau3AI | 
Alwl | | 
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Bf al 



Acelll 
Bbvl 
HpylBBIX 
Alul 
CviJI 
PstI 



SfaNI 
TspRI 
Fnu4HI 
SfaNI 
CviRI) 
Tselj 
BtsI I j 
Sfcl I I 



Ddel j 
I I 



Hpyl78III 
Msel Avail | 
BseMII |Sau96I j 

I I 



CviRI 



AACAACTAGCATCACTGCAGCTCTCTCAGATGCTCTAAACTTAAATGGTCCTGACCTTGC 
1261 + + + + + + 13 20 

TTGTTGATCGTAGTGACGTCGAGAGAGTCTACGAGATTTGAATTTACCAGGACTGGAACG 



Hinfl 
Tfil 



CviRI 



Hpyl88IX 
Bbvl 
. Alul 

Bed CviJI 
Drdll I Hin4l| 
NlaIV| j Hpyl78III Hin4I I 
III I Ml 



Bpml 



1321 



AGGGAATCCTGCATATCAAGGAACCATCGTATTTTCTGGAGAGAAGCTCTCGGAAGCAGA 



-+ 1380 



TCCCTTAGGACGTATAGTTCCTTGGTAGCATAAAAGACCTCTCTTCGAGAGCCTTCGTCT 

Alul 
AlwNI 
CviJI 
Mwol 
PstI 
CviRI 
BplI I 
Fnu4HI I 
Sfcl I 
Alul) 
AlwNI j 
Bpll| 
BstAPI j 
CviJI j 
Mwol I 

Tselj I j I I I Tsp509I 
II 

AGCTGCAGAAGCTGATAATCTCAAATCTACAATTCAGCAACCTCTAACTCTTGCGGGAGG 
1381 + + + + + + 1440 

TCGACGTCTTCGACTATTAGAGTTTAGATGTTAAGTCGTTGGAGATTGAGAACGCCCTCC 



Acil 
Faul Mnll I 
Sthl32I I Mnll I j 
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Hin4I 
I 



Maelll 
Tsp45I 
Hinfl) 

Bpll| | 
Hpyl78III | | j 
Msel | | | | 

I I I I I 



Bfal 
Plel 



Ddel 
I 



Sthl32I 



CviJI 
BseRI 
Neil | 
ScrFI | 
Mspl I j 



GCAACTCTCTCTTAAATCAGGAGTCACTCTAGTTGCTAAGTCCTTTTCGCAATCTCCGGG 

1441 + + + + + + isoo 

CGTTGAGAGAGAATTTAGTCCTCAGTGAGATCAACGATTCAGGAAAAGCGTTAGAGGCCC 



Fokl 
Cjel 
NlalV 
Avail 



SfaNI 



Banll 
Bspl286I 



Sau96I 
CviRI 
Mnll 
Mnll | 
Nlalll| j 
BslI | | | 
I II I 



Alwl 
BsaBI 
Dpnl 

MspAlI Sau3AI | 
Acil | Bed | | 
BsmFI | |Bsll| j j 

Mi II I I 

CTCTACCCTCCTCATGGATGCAGGGACCACATTAGAAACCGCTGATGGGATCACTATCAA 

1501 + + + + + + 1560 

GAGATGGGAGGAGTACCTACGTCCCTGGTGTAATCTTTGGCGACTACCCTAGTGATAGTT 



Bsal 
Hinfl BsmAI 
Tfil Msel 



Bspl286I 
CacSI 
Bmgl | 
BseSI j 



Cjel Tfil Msel BseSI | Bsbl 

I II II I 

TAATCTTGTTCTCAATGTAGATTCCTTAAAAGAGACCAAGAAGGGCACGCTAAAAGCAAC 

1561 + + + + + + 1620 

ATTAGAACAAGAGTTACATCTAAGGAATTTTCTCTGGTTCTTCCCGTGCGATTTTCGTTG 



Tthlllll 
Maelll 
Taal 
Tsp45I 
Hpyl88IX | 
Mwol Cjel) j 



Hpyl78III 
Dpnl 
BstYI 



Dpnl 
Sau3AI | 
Hpyl78IIl| j 

II I 

ACAAGCAAGTCAGACAGTCACTTTATCTGGATGGCTCTCTCTTGTAGATCCTTCTGGAAA 

1621 + + + + + + 1680 

TGTTCGTTCAGTCTGTCAGTGAAATAGACCTAGCGAGAGAGAACATCTAGGAAGACCTTT 



Sau3AI 
Cjel | 
Alwl | | 
Alwl | | j 

I I 



Pflll08I MboII 
AccI | BsmAI | CjePI 

CjePl| | Bce83I || Smll |MnlI BsmAI 

II I I II III I 

TGTCTACGAAGATGTCTCTTGGAATAACCCTCAAGTCTTTTCTTGTCTCACTCTTACTGC 

1681 + + + + + -+ 1740 

ACAGATGCTTCTACAGAGAACCTTATTGGGAGTTCAGAAAAGAACAGAGTGAGAATGACG 
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Dpnl 
Sau3AI 
Fnu4HI 
Alul | 
Alwl j 
CviJI | 
Tsel | 
Ddel | j 



1741 



Faul 
Sspl 
Sthl32I| 
Thai | j 

SimI Acil | j j Bbvl Ddel | | | | Bfal 

I II II I 

TGACGACCCCGCGAATATTCACATCACAGACTTAGCTGCTGATCCCCTAGAAAAAAATCC 

+ + + - + + + xsoo 

ACTGCTGGGGCGCTTATAAGTGTAGTGTCTGAATCGACGACTAGGGGATCTTTTTTTAGG 



1801 



Tsp509I 

BsaJI | BciVI 

BciVI Styl j Mnll | 

I II II 
TATCCATTGGGGATACCAAGGGAATTGGGCATTATCTTGGCAAGAGGATACTGCGACTAA 
n + + + + + 

ATAGGTAACCCCTATGGTTCCCTTAACCCGTAATAGAACCGTTCTCCTATGACGCTGATT 



1860 



Hinfl BspGI 
Fnu4HI | ScrFI | 
Tsel | j Bbvl | j 
Plel|| j EcoRII j | BciVI 
III I III I 



BpulOI 
Ddel 
Hpyl78III 
Hinfl 
Tfil 
Hgal | 
Hpyl88IX j 
BseMII | j 

I I I 



ATCCAAAGCAGCGACTCTTACCTGGACAAAAACAGGATACAATCCGAATCCTGAGCGTCG 

1861 + + + + + + 1920 

TAGGTTTCGTCGCTGAGAATGGACCTGTTTTTGTCCTATGTTAGGCTTAGGACTCGCAGC 



Dpnl 
NlalV 
BamHI 

Cjel BstYI 
Ddel | Sau3AI 
NlalV | j Alwl | 

I II II 
TGGAACCTTAGTTGCTAACACGCTATGGGGATCCTTTGTTGATGTGCGCTCCATACAACA 
1921 + + + + + + 1980 

ACCTTGGAATCAACGATTGTGCGATACCCCTAGGAAACAACTACACGCGAGGTATGTTGT 



Hhal 
Hin4I | 
Alwl Cjel | j 



Alul 
CviJI 



Fnu4HI Dpnl 
Taul BstYI | 

Bce83I Hpyl78III Acil| Sau3AI ( 

CviJI | Rsal Smll | Thai | SfaNI | 

I III II II III 
GCTTGTAGCCACTAAAGTACGCCAATCTCAAGAAACTCGCGGCATCTGGTGTGAAGGGAT 
1981 + + + + + + 2040 

CGAACATCGGTGATTTCATGCGGTTAGAGTTCTTTGAGCGCCGTAGACCACACTTCCCTA 
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Xmnl 
Alwl 
Hpyl78III | 
Taqi | 
Mboll) j 
Ml 



MslI 
BspMI | CviRI 



Mboll 

I III 
CTCGAACTTCTTCCATAAAGATAGCACGAAGATAAATAAAGGTTTTCGCCACATAAGTGC 

2041 + + + "+* + + 2100 

GAGCTTGAAGAAGGTATTTCTATCGTGCTTCTATTTATTTCCAAAAGCGGTGTATTCACG 



Cvi JI 
Pstl| 
TspRI j 
Fnu4HI | 
CviRI | 

Hpyl88IX Tsel| 
Alul | Btsl | | 

Bsgl CviJI j sf cl I 

I II I II 
AGGTTATGTTGTAGGAGCGACTACAACATTAGCTTCTGATAATCTTATCACTGCAGCCTT 
2101 + + + + + + 2160 

TCCAATACAACATCCTCGCTGATGTTGTAATCGAAGACTATTAGAATAGTGACGTCGGAA 

Bbvl Dpnl 
Tsp509I Sau3AI | Alul 

Sthl32I |Hpyl78III BsaBI | j CviJI 

II I II I | 

CTGCCAATTATTCGGGAAAGATAGAGATCACTTTATAAATAAAAATAGAGCTTCTGCCTA 

2161 + + + + + + 2220 

GACGGTTAATAAGCCCTTTCTATCTCTAGTGAAATATTTATTTTTATCTCGAAGACGGAT 

Alul 
CviJI 
Fnu4HI 
Bpml | 
CviRI j 
Mwol j 
Tselj 

II 



Tthllll 
SfaNI 
Bbsl| 
Mboll | 
Bfal | j 

I III 
TGCAGCTTCTCTCCATCTCCAGCATCTAGCGACCTTGTCTTCTCCAAGCTTGTTACGCTA 
2221 + + + + + + 2280 

ACGTCGAAGAGAGGTAGAGGTCGTAGATCGCTGGAACAGAAGAGGTTCGAACAATGCGAT 



Bbvl 
Bed 



Mwol 
Maelll 
Alul | 
CviJI j 
Hindi I I | | 
I I 



Hpyl88IX 
Dpnl 
BstYI | 
Sau3AI j 
ScrFI | | 
EcoRII I 



Sfcl 
BseMII 
Alul 
CviJI 
Dpnl 



SfaNI Sau3AI 

CviJI | Hpyl8 8IX| 

Fnu4HI | j Ddel | | 

Alwl Tsel| | j Bbvl Mnll | || 

I III I I II II 

CCTTCCTGGATCTGAAAGTGAGCAGCCTGTCCTCTTTGATGCTCAGATCAGCTATATCTA 

2281 + + + + + + 2340 

GGAAGGACCTAGACTTTCACTCGTCGGACAGGAGAAACTACGAGTCTAGTCGATATAGAT 
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2341 



Banll 
BsiHKAI 
Bspl286I 
SacI 
BssSI [ 
Alul | 
CviJI | 
Tthlllll | j 
CjePI Taqll | | | 
t I I || 

TAGTAAAAATACTATGAAAACCTATTACACCCAAGCACCAAAGGGAGAGAGCTCGTGGTA 

+ + + + + + 

ATCATTTTTATGATACTTTTGGATAATGTGGGTTCGTGGTTTCCCTCTCTCGAGCACCAT 



2400 



Banll Kflalll 
BsiHKAI MboII 
Bspl286I CviJI 
SacI Mnll 
Alul | Msel | 

CviJI | TspRI | | 

Cac8I | j BtsI I 

I I I I Ml I II I 

TAATGACGGTTGCGCTCTGGAACTTGCGAGCTCCCTACCACACACTGCTTTAAGCCATGA 

2401 + + -- + + + + 2460 

ATTACTGCCAACGCGAGACCTTGAACGCTCGAGGGATGGTGTGTGACGAAATTCGGTACT 



Hpyl78III 
Hhal | 
CjePI | | 
Taal | | | 



SimI 



Thai 
Afllll 
Mlul 
Earl | 
Bsal | j 
BsmAI | j 



Alul 
CviJI 
Hindlll I 



Rsal 

I 



GGGTCTCITCCACGCGTATTTTCCTTTCATCAAAGTAGAAGCTTCGTACATACACCAAGA 



2461 



-+ 2520 



CCCAGAGAAGGTGCGCATAAAAGGAAAGTAGTTTCATCTTCGAAGCATGTATGTGGTTCT 



Mae 1 1 
Hphl| 
Msel 

Dpnl Pad 
Alul BsaJI Sau3AI | Tsp509I | 

CviJI Maell Styl Rsal | j TaqI Acil Msel | j 

I I i II I I I Ml 

TAGCTTCAAAGAACGTAATACTACCTTGGTACGATCTTTCGATAGCGGTGATTTAATTAA 

2521 + + + + + + 2580 

ATCGAAGTTTCTTGC^TTATGATGGAACCATGCTAGAAAGCTATCGCCACTAAATTAATT 
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BsmAI 
BsmBI 



Tsp509I 



Hgal 
Hpyl78III 
Hpyl78III 
Taql 
Aval | 
Smllj 
Hin4I | | 
Hinf I j | 
Hpyl7 8III |XhoI| 
Taql |TfiI | j 

I I I 



BplI 

Cac8I | 
Hin4l j 

II 



2581 



CGTCTCTGTGCCTATTGGAATTACCTTCGAGAGATTCTCGAGAAACGAGCGTGCGTCTTA 



■+ 2640 



GCAGAGACACGGATAACCTTAATGGAAGCTCTCTAAGAGCTCTTTGCTCGCACGCAGAAT 



2641 



Hpyl78III 

Alul Bsgl Hinfl | 

CviJI Taal Maell BsaBI | Tfil j CviRl 

ji i I I i i i i 

CGAAGCTACTGTCATCTACGTTGCCGATGTCTATCGTAAGAATCCTGACTGCACGACAGC 



Alul 
CviJI 



GCTTCGATGACAGTAGATGCAACGGCTACAGATAGCATTCTTAGGACTGACGTGCTGTCG 



-+ 2700 



Bce83I Hpyl78III 
Sfcl| Hinfl | Alul 

Acelll BssSI Mnll | | Tfil Smll CviJI 

I I I II III | 
TCTCCTAATCAACAATACCTCGTGGAAAACTACAGGAACGAATCTCTCAAGACAAGCTGG 
2701 + + + + + + 2760 

AGAGGATTAGTTGTTATGGAGCACCTTTTGATGTCCTTGCTTAGAGAGTTCTGTTCGACC 



Hpyl88IX 
Earl | 
Sapl j 



MboII 
Dpnl 

Bstyi | 

Sau3AI j 
Hin4I | j 
I I I 



Alwl 



Maelll 
Tsp45I 
Hpyl78III | 
Smll | | 
Mnll |j j Maelll 
I I 



TATCGGAAGAGCAGGGATCTTTTATGCCTTCTCTCCAAATCTTGAGGTCACAAGTAACCT 
2761 + + + + + + 2820 

ATAGCCTTCTCGTCCCTAGAAAATACGGAAGAGAGGTTTAGAACTCCAGTGTTCATTGGA 



Dpnl 
BstYI 
Sau3AI 
MboII 
Apol 



Tsp509I 
BslI | 
Bce83I | j 

I 



Alul 
CviJI 
Fnu4HI | 
Tsel| j 
Alwl | j j 



Dpnl 
Mnll 
Bglll| 
BstYI | 
Sau3AI | 
Bbvl | j 
CviRI | | j 
II 



Bsrl 



ATCTATGGAAATTCGTGGATCTTCACGCAGCTACAATGCAGATCTTGGAGGTAAGTTCCA 
2821 + + + + + + 2880 

TAGATACCTTTAAGCACCTAGAAGTGCGTCGATGTTACGTCTAGAACCTCCATTCAAGGT 
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Ddel 
Hpyl78III 
Dpnl 
BstYI 
Sau3AI 
Dpnl BseMII 
Sau3AI | Apol | 

Hpyl78IIl| | Tsp509I Alwl j 
Alwl || | Ddel J Ddel] | | J Bfal 

1 II I I I II 

GTTCTAAAAGCGTTCCTGATC C CTTAGAAATTCTAAGAGATCCTGAGTGTATCTAGGGAC 

2881 + + + --- + + + 2940 

CAAGATTTTCGCAAGGACTAGGGAATCTTTAAGATTCTCTAGGACTCACATAGATCCCTG 



BsmFI 



TTCTCAAAGA 

2941 + 2950 

AAGAGTTTCT 
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Figure 7: Nucleotide and amino acid sequence of CPN100639 

gtacgaagtt cttcacgaaa ttataataca aacctaggct ctaagttttg tttctagatt 60 

atcgaaaacg tgttaattaa ttgaacccaa gcatctttct atg aaa ata ccc ttg 115 

Met Lys lie Pro Leu 
1 5 

cac aaa etc ctg ate tct teg act ctt gtc act ccc att eta ttg age 163 
His Lys Leu Leu lie Ser Ser Thr Leu Val Thr Pro lie Leu Leu Ser 
10 15 20 

att gca act tac gga gca gat get tct tta tec cct aca gat age ttt 211 
lie Ala Thr Tyr Gly Ala Asp Ala Ser Leu Ser Pro Thr Asp Ser Phe 
25 30 35 

gat gga gcg ggc ggc tct aca ttt act cca aaa tct aca gca gat gec 259 
Asp Gly Ala Gly Gly Ser Thr Phe Thr Pro Lys Ser Thr Ala Asp Ala 
40 45 50 

aat gga acg aac tat gtc tta tea gga aat gtc tat ata aac gat get 3 07 
Asn Gly Thr Asn Tyr Val Leu Ser Gly Asn Val Tyr lie Asn Asp Ala 
55 60 65 

ggg aaa ggc aca gca tta aca ggc tgc tgc ttt aca gaa act acg ggt 3 55 
Gly Lys Gly Thr Ala Leu Thr Gly Cys Cys Phe Thr Glu Thr Thr Gly 
70 75 80 85 

gat ctg aca ttt act gga aag gga tac tea ttt tea ttc aac acg gta 403 
Asp Leu Thr Phe Thr Gly Lys Gly Tyr Ser Phe Ser Phe Asn Thr Val 
90 95 100 

gat gcg ggt teg aat gca gga get gcg gca age aca act get gat aaa 451 
Asp Ala Gly Ser Asn Ala Gly Ala Ala Ala Ser Thr Thr Ala Asp Lys 
105 110 115 

gee eta ate ttc aca gga ttt tct aac ctt tec ttc att gca get cct 499 
Ala Leu lie Phe Thr Gly Phe Ser Asn Leu Ser Phe lie Ala Ala Pro 
120 125 130 

gga act aca gtt get tea gga aaa agt act tta agt tct gca gga gec 547 
Gly Thr Thr . Val Ala Ser Gly Lys Ser Thr Leu Ser Ser Ala Gly Ala 
135 140 145 

tta aat ctt ace gat aat gga acg att etc ttt age caa aac gtc tec 595 
Leu Asn Leu Thr Asp Asn Gly Thr lie Leu Phe Ser Gin Asn Val Ser 
150 155 160 165 

aat gaa get aat aac aat ggc gga gcg ate acc aca aaa act ctt tct 643 
Asn Glu Ala Asn Asn Asn Gly Gly Ala lie Thr Thr Lys Thr Leu Ser 
170 175 * 180 

att tct ggg aat acc tct tct ata acc ttc act agt aat age yea daa o91 
lie Ser Gly Asn Thr Ser Ser lie Thr Phe Thr Ser Asn Ser Ala Lys 
185 190 195 
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aaa tta ggt gga gcg ate tat age tct gcg g-:t gca agt act tea gga 739 
Lys Leu Gly Gly Ala lie Tyr Ser Ser Ala Ala Ala Ser lie Ser Gly 
200 " 205 210 

aac acc ggc cag tta gtc ttt atg aat aat aaa gga gaa act ggg ggt 787 * 
Asn Thr Gly Gin Leu Val Phe Met Asn Asn Lys Gly Glu Thr Gly Gly 
215 :220 225 

9gg get ctg ggc ttt gaa gec age tec teg att act caa aat age tec 835 
Gly Ala Leu Gly Phe Glu Ala Ser Ser Ser lie Thr Gin Asn Ser Ser 
230 235 240 245 

ctt ttc ttc tct gga aac act gca aca gat get gca ggc aag ggc ggg 883 
Leu Phe Phe Ser Gly Asn Thr Ala Thr Asp Ala Ala Gly Lys Gly Gly 
250 255 260 

gee att tat tgt gaa aaa aca gga gag act cct act ctt act ate tct 931 
Ala He Tyr Cys Glu Lys Thr Gly Glu Thr Pro Thr Leu Thr He Ser 
265 270 275 

gga aat aaa agt ctg acc ttc gec gag aac tct tea gta act caa ggc 979 
Gly Asn Lys Ser Leu Thr Phe Ala Glu Asn Ser Ser Val Thr Gin Gly 
280 285 290 

gga gca ate tgt gec cat ggt eta gat ctt tec get get ggc cct acc 1027 
Gly Ala He Cys Ala His Gly Leu Asp Leu Ser Ala Ala Gly Pro Thr 
295 300 305 

eta ttt tea aat aat aga tgc ggg aac aca get gca ggc aag ggc ggc 1075 
Leu Phe Ser Asn Asn Arg Cys Gly Asn Thr Ala Ala Gly Lys Gly Gly 
310 315 320 325 

get att gca att gee gac tct gga tct tta agt etc tct gca aat caa 1123 
Ala He Ala He Ala Asp Ser Gly Ser Leu Ser Leu Ser Ala Asn Gin 
330 335 340 

gga gac ate acg ttc ctt ggc aac act eta acc tea acc tec gcg cca 1171 
Gly Asp lie Thr Phe Leu Gly Asn Thr Leu Thr Ser Thr Ser Ala Pro 
345 350 355 

aca teg aca egg aat get ate tac ctg gga teg tea gca aaa att acg 1219 
Thr Ser Thr Arg Asn Ala He Tyr Leu Gly Ser Ser Ala Lvs He Thr 
360 365 " 370 

aac tta agg gca gec caa ggc caa tct ate tat ttc tat gat ccg att 1267 
Asn Leu Arg Ala Ala Gin Gly Gin Ser He Tyr Phe Tyr Asd Pro He 
375 380 385 

gca tct aac acc aca gga get tea gac gtt ctg acc ate aac caa ccg 1315 
Ala Ser Asn Thr Thr Gly Ala Ser Asp Val Leu Thr He Asr. Gin Pro 
390 395 400 405 

gat age aac teg cct tta gat tat tea gga acg att gta ttt tct ggg 1363 
Asp Ser Asn Ser Pro Leu Asp Tyr Ser Gly Thr He Val Phe Ser Gly 
410 415 420 
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gaa aag etc tct gca gat gaa gcg aaa get get gat aac ttc aca tct 1411 
Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala Asp Asn Phe Thr Ser 
425 430 435 

ata tta aag caa cca ttg get eta gec tct gga ace tta gca etc aaa 1459 
He Leu Lys Gin Pro Leu Ala Leu Ala Ser Gly Thr Leu Ala Leu Lys 
440 445 450 

gga aat gtc gag tta gat gtc aat ggt ttc aca cag act gaa ggc tct 1507 
Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr Gin Thr Glu Gly Ser 
455 460 465 

aca etc etc atg caa cca gga aca aag etc aaa gca gat act gaa get 1555 
Thr Leu Leu Met Gin Pro Gly Thr Lys Leu Lys Ala Asp Thr Glu Ala 
470 475 480 485 

ate agt ctt acc aaa ctt gtc gtt gat ctt tct gee tta gag gga aat 1603 
He Ser Leu Thr Lys Leu Val Val Asp Leu Ser Ala Leu Glu Gly Asn 
490 495 500 

aag agt gtg tec att gaa aca gca gga gec aac aaa act ata act eta 1651 
Lys Ser Val Ser He Glu Thr Ala Gly Ala Asn Lys Thr He Thr Leu 
505 510 515 

acc tct cct ctt gtt ttc caa gat agt age ggc aat ttt tat gaa age 1699 
Thr Ser Pro Leu Val Phe Gin Asp Ser Ser Gly Asn Phe Tyr Glu Ser 
520 525 530 

cat acg ata aac caa gee ttc acg cag cct ttg gtg gta ttc act get 1747 
His Thr He Asn Gin Ala Phe Thr Gin Pro Leu Val Val Phe Thr Ala 
535 540 545 

get act get get age gat att tat ate gat gcg ctt etc act tct cca 1795 
Ala Thr Ala Ala Ser Asp He Tyr He Asp Ala Leu Leu Thr Ser Pro 
550 555 560 565 

gta caa act cca gaa cct cat tac ggg tat cag gga cat tgg gaa gec 1843 
Val Gin Thr Pro Glu Pro His Tyr Gly Tyr Gin Gly His Trp Glu Ala 
570 575 580 

act tgg gca gac aca tea act gca aaa tea gga act atg act tgg gta 1891 
Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly Thr Met Thr Trp Val 
585 590 595 

act acg ggc tac aac cct aat cct gag cgt aga get tec gta gtt ccc 1939 
Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Ala Ser Val Val Pro 
600 605 610 

gat tea tta tgg gca tec ttt act gac att cgc act eta cag cag ate 1987 
Asp Ser Leu Trp Ala Ser Phe Thr Asp He Arc Thr Leu Gin Gin He 
615 62C £25 

atg aca tct caa gcg aat agt ate tat cag caa cga gga etc tog gca 2035 
Met Thr Ser Gin Ala Asn Ser He Tyr Gin Gin Arg Gly Leu Trp Ala 
630 635 640 " " 645 
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tea gga act gcg aat ttc ttc cat aag gat aaa tea gga act aac caa 2083 
Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys Ser Gly Thr Asn Gin 
650 655 * 660 

gca ttc cga cat aaa age tac ggc tat att gtt gga gga agt get gaa 213i 
Ala Phe Arg His Lys Ser Tyr Gly Tyr lie Val Gly Gly Ser Ala Glu 
665 670 675 

gat ttt tct gaa aat ate ttc agt gta get ttc tgc cag etc ttc ggt 2179 
Asp Phe Ser Glu Asn He Phe Ser Val Ala Phe Cys Gin Leu Phe Gly 
680 685 690 

aaa gat aaa gac ctg ttt ata gtt gaa aat ace tct cat aac tat tta 2227 
Lys Asp Lys Asp Leu Phe He Val Glu Asn Thr Ser His Asn Tyx Leu 
695 700 705 

gcg teg eta tac ctg caa cat cga gca ttc eta gga gga ctt ccc atg 2275 
Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu Gly Gly Leu Pro Met 
? 1° 715 720 725 

ccc tea ttt gga agt ate acc gac atg ctg aaa gat att cct etc att 2323 
Pro Ser Phe Gly Ser lie Thr Asp Met Leu Lys Asp He Pro Leu He 
730 735 740 

ttg aat gee cag eta age tac age tac act aaa aat gat atg gat act 2371 
Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys Asn Asp Met Asp Thr 
745 750 755 

cgc tat act tec tat cct gaa get caa ggc tct tgg acc aat aac tct 2419 
Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser Trp Thr Asn Asn Ser 
760 765 770 

ggg get eta gag etc gga gga tct ctg get eta tat etc. cct aaa gaa 2467 
Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu Tyr Leu Pro Lys Glu 
7 *75 780 785 

gca ccg ttc ttc cag gga tat ttc ccc ttc tta aag ttc cag gca gtc 2515 
Ala Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu Lys Phe Gin Ala Val 
790 795 800 805 

tac age cgc caa caa aac ttt aaa gag agt ggc get gaa gee cgt get 2563 
Tyr Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly Ala Glu Ala Arg Ala 
810 815 820 

ttt gat gat gga gac eta gtg aac tgc tct ate cct gtc ggc att egg 2611 
Phe Asp Asp Gly Asp Leu Val Asn Cys Ser He Pro Val Glv He Arg 
825 830 835 

tta gaa aaa ate tec gaa gat gaa aaa aat aat tec gag att tct eta 2659 
Leu Glu Lys He Ser Glu Asp Glu Lys Asn Asn Phe Glu He Ser L-u 
840 345 850 

gec tac att ggt gat gtg tat cgt aaa aat ccc cgt teg cgt act tct 2707 
Ala Tyr He Gly Asp Val Tyr Arg Lys Asn Pro Arg Ser Ara Thr Ser 
255 860 S65 
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eta atg gtc agt gga gec tct tgg act teg eta tgt aaa aac etc gca 2755 
Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu Cys Lys Asn Leu Ala 
870 875 880 885 

cga caa gee ttc tta gca agt get gga age cat ctg act etc tec cct 2803 
Arg Gin Ala Phe Leu Ala Ser Ala Gly Ser His Leu Thr Leu Ser Pro 
890 895 900 

cat gta gaa etc tct ggg gaa get get tat gag ctt cgt ggc tea gca 2 851 
His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu Leu Arg Gly Ser Ala 
905 910 915 

cac ate tac aat gta gat tgt ggg eta aga tac tea ttc tagttcctac 2900 
His lie Tyr Asn Val Asp Cys Gly Leu Arg Tyr Ser Phe 
920 925 930 



tttcctccct aaacttttag ggaggaattc ttataaaaac cctgtagatt cttaacttac 2960 
tagtctctcc tttcctcttg ctttctttaa tttattgcag 3000 
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Figure 8: Restriction enzyme analysis of CPN 100639 



Tsp509I 

MboII Hpyl78III | 
Rsal XmnI | j 

I III 



Ddel 
CviJI 
Bfal 
Avrll | 
BsaJI | 
Styl| 
II 



Hpyl78III 
Bfal | 
Xbal | j 

II 



GTACGAAGTTCTTCACGAAATTATAATACAAACCTAGGCTCTAAGTTTTGTTTCTAGATT 



CATGCTTCAAGAAGTGCTTTAATATTATGTTTGGATCCGAGATTCZAAAACAAAGATCTAA 

Tsp509I 
Msel| 
Pad j 

Vspl I 
Tsp509I | j 
Af 1III | | | Tthlllll 

Taql Maell Msel j || SfaNI I CviRi 

I I II II II | 

ATCGAAAACGTGTTAATTAATTGAACCCAAGCATCTTTCTATGAAAATACCCTTGCACZAA 



- + 60 



61 



•+ 120 



TAGCTTTTGCACAATTAATTAACTTGGGTTCGTAGAAAGATACTTTTATGGGAACGTGTT 



Plel 
Dpnl 
Sau3AI | 
Hpyl78IIl| j 
MboII I I I 



I II I 



Maelll 
Tsp45I 
PshAI 
Earl | 
Hinfl j 
Taql | | 
I I I 



CviRI 
BsrDI | SfaNI 
I I 



121 



ACTCCTGATCTCTTCGACTCTTGTCACTCCCATTCTATTGAGCATTGCAACTTACGGAGC 



-+ 180 



TGAGGACTAGAGAAGCTGAGAACAGTGAGGGTAAGATAACTCGTAACGTTGAATGCCTCG 



CviJI 
Fnu4HI 
Taul 
Acil 
Cac8I 
Acil 



Sfcl 



BsrBI 
Bed 
Faul 
Sthl32I | 
Alul | | 
CviJI 



I II 



BsaXI 



181 



AGATGCTTCTTTATCCCCTACAGATAGCTTTGATGGAGCGGGCGGCTCTACATTTACTCC 



-+ 240 



TCTACGAAGAAATAGGGGATGTCTATCGAAACTACCTCGCCCGCCGAGATGTAAATGAGG 
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Sf cl 
SfaNI | 
I I 



Hpyl78III 
I 



SfaNI 



AAAATCTACAGCAGATGCCAATGGAACGAACTATGTCTTATCAGGAAATGTCTATATAAA 

241 + + - + + + + 300 

TTTTAGATGTCGTCTACGGTTACCTTGCTTGATACAGAATAGTCCTTTACAGATATATTT 



Hpyl88IX 
Dpnl 

Fnu4HI Sau3AI 
Tsel| BscGI 
Msel Fnu4HI | j Bsp24I 
Bbvl | CyiJI | | j cjel 
Bbvl | j Tselj || Sthl32I CjePI 

III II II I I 

CGATGCTGGGAAAGGCACAGCATTAACAGGCTGCTGCTTTACAGAAACTACGGGTGATCT 

301 + + + + + + 360 

GCTACGACCCTTTCCGTGTCGTAATTGTCCGACGACGAAATGTCTTTGATGCCCACTAGA 



361 



Faul NspV 

Taal TaqI 

Cjel Sthl32l| Bbvl | 

BsrI CjePI | Bsbl || Acelll | j 

HphI BciVI | Bsp24l| j SfaNI | j j Acil | j | CviRI 

III Ml I I II I III I 

GACATTTACTGGAAAGGGATACTCATTTTCATTCAACACGGTAGATGCGGGTTCGAATGC 

f- + H + + 

CTGTAAATGACCTTTCCCTATGAGTAAAAGTAAGTTGTGCCATCTACGCCCAAGCTTACG 



420 



Cac8I 
Fnu4HI 
Taul 
Acil 



Fnu4HI 

Alul 
AlwNI 
BstAPI 
CviJI 
Mwol 
Tsel 
BsmI | 



CviJI 
MboII | 
Mwol | | 
Tttilllll| j | 

II II 

AGGAGCTGCGGCAAGCACAACTG CTGATAAAG CCCTAATCTTCACAGGATTTTCTAACCT 



421 



480 



TCCTCGACGCCGTTCGTGTTGACGACTATTTCGGGATTAGAAGTGTCCTAAAAGATTGGA 
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Eco57I 
ScrFI 

AlwMI 
EcoRII 
Alul 



CviJI 
Fnu4HI | 
CviRI | I 
BsrDI Tselj j 
I II I 



Acelll 

Sfcl Hpyl78III 
BbvIlTaal I 



Rsal 
Seal 
TatI |MseI 
I I 



CviRI 
Sfcl | 

I I 



481 



TTCCTTCATTGCAGCTCCTGGAACTACAGTTGCTTCAGGAAAAAGTACTTTAAGTTCTGC 



AAGGAAGTAACGTCGAGGACCTTGATGTCAACGAAGTCCTTTTTCATGAAATTCAAGACG 



•+ 540 



Msel 
Cjel 
CviJI | 
NlaIV| j 
PstI | | | 



CjePI 

Hinfl | CviJI BsmAI 

Tfil j Cjel | Maell BsmBI 
I I 



541 



AGGAGCCTTAAATCTTACCGATAATGGAACGATTCTCTTTAGCCAAAACGTCTCCAATGA 



TCCTCGGAATTTAGAATGGCTATTACCTTGCTAAGAGAAATCGGTTTTGCAGAGGTTACT 

Dpnl 
Sau3AI 
CjePI 
Ecil I 

HphI | | | CjePI 
Acil| j | | Mbolll 



- + 600 



Alul 
CviJI CjePI 



601 



I 

AGCTAATAACAATGGCGGAGCGATCACCACAAAAACTCTTTCTATTTCTGGGAATACCTC 
+ +. + + _ ( + 

TCGATTATTGTTACCGCCTCGCTAGTGGTGTTTTTGAGAAAGATAAAGACCCTTATGGAG 



660 



Bbvl 

Sfcl Fnu4HI 

Mnll Bfal Tsp509I Dpnl | Alul Taul 

Earl | Spel| Hhal | Sau3AI | CviJI Acil I 

II II II Mill' 

TTCTATAACCTTCACTAGTAATAGCGCAAAAAAATTAGGTGGAGCGATCTATAGCTCTGC 
661 + + + + + + 720 

AAGATATTGGAAGTGATCATTATCGCGTTTTTTTAATCCACCTCGCTAGATATCGAGACG 



CviRI 
Fnu4HI | 
CviJI | j 
Tselj | 



Bsrl 
CviJI 
Haeill 
Eael 
Gdill 
Mspl| 

Hpyl78III BsrFI | j 



III 



Cjel 



721 



GGCTGCAAGTATTTCAGGAAACACCGGCCAGTTAGTCTTTATGAATAATAAAGGAGAAAC 



-+ 780 



CCGACGTTCATAAAGTCCTTTGTGGCCGGTCAATCAGAAATACTTATTATTTCCTCTTTG 
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J 



Banll 
Bsrl Bspl286I 



Alul 
CviJI 
Cac8X 
CviJI 
Cjel | 
BseRll 



Bmrl 
I 



CviJI | CviJI 



I Ml 



. Mill I 
Acelll | 
Taql | | 

I I I 



MboII 
Alul | 
CviJI | 

II 



TGGGGGTGGGGCTCTGGGCTTTGAAGCCAGCTCCTCGATTAOTCAAAATAGCTCCCTTTT 
781 + + + + + „ _ + 840 

ACCCCCACCCCGAGACCCGAAACTTCGGTCGAGGAGCTAATGAGTTTTATCGAGGGAAAA 



Faul 
CacSI 
PstI 
Sthl32I 
CviRI 
Fnu4HI 
Sfcl 
AlwNI 



Tsel 
BstAPI 
TspRI | 
CviRI | | 
Btsl | | | 
Bbvl Ml | 
Hpyl78III SfaNI | | |MwoI 



CviJI 
Haelll 
NlalVj 
Sau96l| j 
Acil | | | 
Cjel | HI 



CTTCTCTGGAAACACTGCAACAG ATG CTGCAGGCAAGGG CGGGGC CATTTATTGTGAAAA 

841 + + + + + + 900 

GAAGAGACCTTTGTGACGTTGTCTACGACGTCCGTTCCCGCCCCGGTAAATAACACTTTT 



Hin4I 
Cjel | 
Plel Cjel || 
BsmAI | Hinf I | j Hpyl7 8III 

M I II I I II || 
AACAGGAGAGACTCCTACTCTTACTATCTCTGGAAATAAAAGTCTGACCTTCGCCGAGAA 
901 + + + + + + 960 

TTGTCCTCTCTGAGGATGAGAATGATAGAGACCTTTATTTTCAGACTGGAAGCGGCTCTT 



ECOS7I 
Cjel | Bce83I 
Hpyl88IX | j MboII 
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Dpnl 
Bglll 
BstYI 
Sau3AI 
Hpyl78IIl| 
Bfall I 



Bbvl 
Xbal 
Nlalll 
BsaJI 



Earl 
Maelll 



Smll 



Ecil 
Acil I 



Bspl286I 
BstDSI 
Ncol 
Styl 
Bmgl 
BseSI 
BstAPI | 
Mwol | 
I I 



II 
II 



II 
II 
II 



CviJI 
Haelll 
SaU96I 
Cac8I 
Fnu4HI 
MspAlI | 
Tsel I 
Acil | I 



I II 



961 



CTCTTCAGTAACTCAAGGCGGAGCAATCTGTGCCCATGGTCTAGATCTTTCCGCTGCTGG 

+ + + + + + l02o 

GAGAAGTCATTGAGTTCCGCCTCGTTAGACACGGGTACCAGATCTAGAAAGGCGACGACC 



Faul 
Sthl32l| Acil 
Sf aNI | j Bbvl | 



CacSI 
PstI 
CviRl 
Fnu4HI 
Sfcl 
Alul| 
CviJI | 
MspAllj 
PvuII | 
Tsel 



Fnu4HI 
Taul 
Acil (Haell 
Mwol|Hhal| 



1021 



CCCTACCCTATTTTCAAATAATAGATGCGGGAACACAGCTGCAGGCAAGGGCGGCGCTAT 



GGGATGGGATAAAAGTTTATTATCTACGCCCTTGTGTCGACGTCCGTTCCCGCCGCGATA 



-+ 1080 



Plel 
Muni | 
Tsp509I j 
CviRI I 



Dpnl 
BstYI 
Sau3AI 
Hpyl78IIl| 
HaeIV| | 
Hin4I J j 
Hinf I | j | 



1081 



BsmAI 

Alwl CviRI | BsaJI 

Msel | BsmAI | |CjeI Maell Styl 

M I I I I || 

TGCAATTGCCGACTCTGGATCTTTAAGTCTCTCTGCAAATCAAGGAGACATCACGTTCCT 



ACGTTAACGGCTGAGACCTAGAAATTCAGAGAGACGTTTAGTTCCTCTGTAGTGCAAGGA 



-+ 1140 
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Mnll 
Hhal 
Thai ( 
Mnll | [ 
Acilll I 



1141 



Sau3AI 
ScrFI 
BsaJlf 
EcoRII | j 

Bsbl Cjel Acil|| [ (Taql BsmI ||| 

I I III . 

TGGCAACACTCTAACCTCAACCTCCGCGCCAACATCGACACGGAATGCTATCTACCTGGG 



- + 1200 



ACCGTTGTGAGATTGGAGTTGGAGGCGCGGTTGTAGCTGTGCCTTACGATAGATGGACCC 

Bbvl 
CviJI 
Hael 
Haelll 
BsaJI 
Sty I 
CviJI 
Pnu4HI 
Tsel| 

CjePI I I I | |[ Dpnl 
Msel | || | | |t CjePI| 
Aflll| I II I I II Sau3Al|| 

Smll| | || I I II Alwl I)! 

ATCGTCAGCAAAAATTACGAACTTAAGGGGAGCCC^ 
1201 + + + + _. + + 1260 

TAGCAGTCGTTTTTAATGCTTGAATTCCCGTCGGGTTCCGGTTAGATAGATAAAGATACT 

Hpyl88IX 
Maell 
Hpyl88IX | 
Eco57I Alul | j 

Hpyl88IX CviRI [SfaNI CviJI | | 



Tsp509I 
Ppnl Alwl | 



1261 



Mspl 

Bed BsaWl| Hin4I 

1 " I 

TCCGATTGCATCTAACACCACAGGAGCrrTCTVGACGTTCTGACCATCAACCAACCGGATAG 



AGGCTAACGTAGATTGTGGTGTC CTCGAAGTCTGCAAGACTGGTAGTTGGTTGG C CTATC 



•+ 1320 



Bbvl 
PstI 

Alul CviRI | 

Hpyl78III CviJI Sfcl | j 

I I I I I 
CAACTCGCCTrTTAGATTATTCAGGAACGATTGTATTTTCTGGGGAAAAGCTCTCTGCAGA 
1321 + + + + + + 1380 

GTTGAGCGGAAATCTAATAAGTCCTTGCTAACATAAAAGACCCCTTTTCGAGAGACGTCT 



FHU4HI 

Alul | 
CviJI | 

Mwol j 

Mwol Tsel | Msel 

i ii i ill 

TGAAGCGAAAGCTGCTGATAACTTCACATCTATATTAAAGCAACCATTGGCTCTAGCCTC 
1381 + + + _. + + + 1440 

ACTTCGCTTTCGACGACTATTGAAGTGTAGATATAATTTCGTTGGTAACCGAGATCGGAG 



CviJI 
Bfal | 
CviJI | j 
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BpulOI 
Ddel 
Mnll 
NlalV | 
Hpyl78III | j 



TaqI 



1441 



TGGAACCTTAGCACTCAAAGGAAATGTCGAGTTAGATGTCAATGGTTTCACACAGACTGA 

+ + + + + + i50Q 

ACCTTGGAATCGTGAGTTTCCTTTACAGCTCAATCTACAGTTACCAAAGTGTGTCTGACT 



CviJI 
BseRI | 
II 



ScrFI 
Mnll 
EcoRII 
CviRI | 
Nlalll j 
Eco57I I 



Alul 
CviJI 

I 



Alul 
AlwNTI CviJI 



1501 



AGGCTCTACACTCCTCATGCAACCAGGAACAAAGCTCAAAGCAGATACTGAAGCTATCAG 

+ + _ + + + + 1560 

TCCGAGATGTGAGGAGTACGTTGGTCCTTGTTTCGAGTTTCGTCTATGACTTCGATAGTC 



1561 



Dpnl Ddel 
Eco57I Sau3AI | Mnll | 

I I I I I 

TCTTACCAAACTTGTCGTTGATCTTTCTGCCTTAGAGGGAAATAAGAGTGTGTCCATTGA 

+ + + + + + 162Q 

AGAATG G TTTG AACAG CAACT AG AAAG ACGG AAT CTC C CTTT ATT C T CACACAGGTAACT 



1621 



CviJI 

NlalVf BseRI Mnll Mnll 

I I I III 

AACAGCAGGAGCCAACAAAACTATAACTCTAACCTCTCCTCTTGTTTTCCAAGATAGTAG 



Acil 



•+ 1680 



TTGTCGTCCTCGGTTGTTTTGATATTGAGATTGGAGAGGAGAACAAAAGGTTCTATCATC 

Tsp509I CviJI 
Fnu4HI | Fnu4HI | Bbvl 

Tau * I CviJI CviJI Tsell Bbvl I 

ii i i ii II i I 

CGGCAATTTTTATGAAAGCCATACGATAAACCAAGCCTTCACGCAGCCTTTGGTGGTATT 
1681 + + + + + + 1?40 

GCCGTTAAAAATACTTTCGGTATGCTATTTGGTTCGGAAGTGCGTCGGAAACCACCATAA 



TspRI 
Fnu4HI | 
Tsel | j 
Btsl | | | 



CacSI 
Bfal 
Nhel 
Fnu4HI 
Mwol| 
Tsel 



Bbvl | | | |MwoI I 
I I Ml I II 



SfaNI 



Bpml 
Clal | 
TaqI jtihal 
I I 



Rsal 
TatI | 
Bpml BsrI | | 
I 



1741 



CACTGCTGCTACTGCTGCTAGCGATATTTATATCGATGCGCTTCTCACTTCTCCAGTACA 

H H + + H [. 

GTGACGACGATGACGACGATCGCTATAAATATAGCTACGCGAAGAGTGAAGAGGTCATGT 



1800 
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Sthl32I. 
Hpyl78TII | 



Mnll 

BscGI| BsniFI Bgll 

Bsll || CjePI CviJI | Mwol 

I I I M I II I 

AACTCCAGAACCTCATTACGGGTATCAGGGACATTGGGAAGCCACTTGGGCAGACACATC 
1801 + + + - + + + 1Q60 

TTGAGGTCTTGGAGTAATGCCCATAGTCCCTGTAACCCTTCGGTGAACCCGTCTGTGTAG. 



Hpyl78III 
CviRI | 
CjePI | I 



BpulOI 
Ddel 

Maelll CviJI Hpyl78III 

Sthl32l| BscGI iBseMII I 



AACTGCAAAATCAGGAACTATGACTTGGGTAACTACGGGCTACAACCCTAATCCTGAGCG 

1861 + + + + + + 1920 

TTGACGTTTTAGTCCTTGATACTGAACCCATTGATGC C CGATGTTGGGATTAGGACTCG C 

Sthl32I 
Hinf I 
Tf il 
Fokl | 

Alul Hpyl78IIl| j 
CviJI CjePI | j j 



Bce83I 
Sfcl | 



CjePI 
SfaNI | 

II II 
TAGAGCTTCCGTAGTTCCCGATTCATTATGGGCATCCTTTACTGACATTCGCACTCTACA 

1921 + + - + + + + 1980 

ATCTCGAAGGCATCAAGGGCTAAGTAATACCCGTAGGAAATGACTGTAAGCGTGAGATGT 

Nlalll 
Hpyl78III 
Real | 
Dpnl| | 

Sau3AI (III Srall Mnll | Hinf I Hpyl78III 



Plel 
Mnll I Hinf I 



GCAGATCATGACATCTCAAGCGAATAGTATCTATCAGCAACGAGGACTCTGGGCATCAGG 
1981 + + + - + + + 2040 

CGTCTAGTACTGTAGAGTTCGCTTATCATAGATAGTCGTTGCTCCTGAGACCCGTAGTCC 



Apol 
Tsp509I 
Mboll | 
AlwNI | j 

SfaNI | (Xmnl Hpyl78III 

III ( I 
AACTXSCGAATTTCTTCCATAAGGATAAATCAGGAACTAACCAAGCATTCCGAC^TAAAAG 
2041 + + + + + + 2100 

TTGACGCTTAAAGAAGGTATTCCTATTTAGTCCTTGATTGGTTCGTAAGGCTGTATTTTC 



Alul 
CviJI 
Tthlllll | 
Hpyl88IX | j 

Bsml | Mmel j | 



Mmel 

CviJI Mnll | Bcefl 



Mboll 

Mboll | Alul 

Hpyl88IX| j CviJI 

ECOS7I (Il Eco57I TspRl| 



2101 



CTACGGCTATATTGTTGGAGGAAGTGCTGAAGATTTTTCTGAAAATATCTTCAGTGTAGC 



-+ 2160 



GATGCCGATATAACAACCTCCTTCACGACTTCTAAAAAGACTTTTATAGAAGTCACATCG 
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Alul 

cviJi 

CacSI | 
MboII | | 
I I I 



Acelll 
Earl | 
Sapl j 



Hgal 
I 



2161 



TTTCTGCCAGCTCTTCGGTAAAGATAAAGACCTGTTTATAGTTGAAAATACCTCTCATAA 



-+ 2220 



AAAGACGGTCGAGAAGCCATTTCTATTTCTGGACAAATATCAACTTTTATGGAGAGTATT 

Bfal 
Avrll | 
BsaJI | 
Styl| 

CviRI Taqi Mnll | | 
Mnll - Mwol |BspMl|BsmI | || Nlalll 

I II II I I I I | 

CTATTTAGCGTCGCTATACCTGCAACATCGAGCATTCCTAGGAGGACTTCCCATGCCCTC 



2221 



GATAAATCGCAGCGATATGGACGTTGTAGCTCGTAAGGATCCTCCTGAAGGGTACGGGAG 



2280 



Mnll 
HphI | 
BslI | j 



Nlalll 
MslI Nspl 



Mnll 



Alul 
CviJI 
Bpull02I 
Ddel 
Alul | 
CviJI j 
BsmX | | 



2281 



ATTTGGAAGTATCACCGACATGCTGAAAGATATTCCTCTCATTTTGAATGCCCAGCTAAG 



-+ 2340 



TAAACCTTCATAGTGGCTGTACGACTTTCTATAAGGAGAGTAAAACTTACGGGTCGATTC 



Alul 
CviJI 

Sf cl | BciVI 

ill i 

CTACAGCTACACTAAAAATGATATGGATACTCGCTATACTTCCTATCCTGAAGCTCAAGG 



Smll 

Hpyl78III Alul | 
Bce83I | CviJI | CviJI 



2341 



-+ 2400 



GATGTCGATGTGATTTTTACTATACCTATGAGCGATATGAAGGATAGGACTTCGAGTTCC 
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Eco57I 
Avail | 
Sau96I | 
I I 



Dpnl 
BstYI 
Sau3AI 
Hpyl8 8IX 
Banll 



BsiHKAI 
Bspl286I 
Sad 
Alul 
CviJI 
Mnll 
CjePI 
Hpyl78III 
Bfal 
Banll | 
Bspl286I | 
Xbal I 
CviJI | J 
I II 



CviJI 
Alwll 



CjePI 
I 



CTCTTGGACCAATAACTCTGGGGCTCTAGAGCTCGGAGGATCTCTGGCTCTATATCTCCC 
2401 + + + + + + 2460 

GAGAACCTGGTTATTGAGACCCCGAGATCTCGAGCCTCCTAGAGACCGAGATATAGAGGG 



Mboll Taal 



ScrFI 
BsaJI | 
EcoRIl) j 
III 
III 



ScrFI 
Ms el EcoRII | 



Sfcl 
AccI | CviJI 



TAAAGAAGCACCGTTCTTCCAGGGATATTTCCCCTTCTTAAAGTTCCAGGCAGTCTACAG 

2461 - — - + + + + + + 2520 

ATTTCTTCGTGGCAAGAAGGTCCCTATAAAGGGGAAGAATTTCAAGGTCCGTCAGATGTC 



Bed 

Acil Sthl32I Eco57l| 

Fnu4HI Dral Haell BscGI | Bsal | | Bfal 

Taul Msel | Hhal | CviJI | | BsmAI | | BsaXI | 

I II II I I I I II II 

CCGCCAACAAAACTTTAAAGAGAGTGGCGCTGAAGCCCGTGCTTTTGATGATGGAGACCT 



2521 



■+ 2580 



GGCGGTTGTTTTGAAATTTCTCTCACCGCGACTTCGGGCACGAAAACTACTACCTCTGGA 



BsmI Hpyl88IX Mboll 

I I I 

AGTGAACTGCTCTATCCCTGTCGGCATTCGGTTAGAAAAAATCTCCGAAGATGAAAAAAA 



2581 



-+ 2640 



TCACTTGACGAGATAGGGACAGCCGTAAGCCAATCTTl 



Hpyl78III CviJI Sthl32I 

Tsp509I TaqI | Bfal | BsaBI HphI BscGI Thai 

III II I I II 

TAATTTCGAGATTTCTCTAGCCTACATTGGTGATGTGTATCGTAAAAATCCCCGTTCGCG 

2641 + + + + + + 2700 

ATTAAAGCTCTAAAGAGATCGGATGTAACCACTACACATAGCATTTTTAGGGGCAAGCGC 
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CviJI 

NlalVt MnlI 
Rsal TspRl) MnlI 

I Ml 

TACTTCTCTAATGGTCAGTGG AG CCTCTTGGACTTCGCTATGTAAAAACCTCG CACGACA 
2701 + + + + 



Mwol 



^ -i- + + + 2760 

ATGAAGAGATTACCAGTCACCTCGGAGAACCTGAAGCGATACATTTTTGGAGCGTGCTGT 



Hinfl 
Hpyl88IX 
Bed | 

Mwol Plel | | 

CviJI Ddel | CviJI | j | 

I II III 



Bbvl 
MnlI | 
Nlalll | | 

II 



AGCCTTCTTAGCAAGTGCTGGAAGCCATCTGACTCTCTCCCCTCATGTAGAACTCTCTGG 
+ + + + h + 

TCGGAAGAATCGTTCACGACCTTCGGTAGACTGAGAGAGGGGAGTACATCTTGAGAGACC 



Fnu4HI 

Alul| Bpull02I 

CviJI | Alul Ddel RleAI Ddel 

Tselj CviJI CviJI) BseMII | CviJI I 

ii i ii ii i! 

GGAAGCTGCTTATGAGCTTCGTGGCTCAGCACACATCTAC^\ATGTAGATTGTGGGCTAAG 
2821 — — — — ^ h - 4 — 2880 

CCTTCGACGAATACTCGAAGCACCGAGTCGTGTGTAGATGTTACATCTAACACCCGATTC 

Apol 

MnlI EcoRI 

Bfal MnlI) Tsp509I 

I II I 

ATACTCATTCTAGTTCCTACTTTCCTCCCTAAACTITTAGGGAGGAATTCTTATAAAAAC 
2881 + + + + + + 2g40 

TATGAGTAAGATCAAGGATGAAAGGAGGGATTTGAAAATCCCTCCTTAAGAATATTTTTG 

Tsp509I 

Hinfl Bfal Msel | 

Sfcl Tfil Msel Spel| BsmAI MnlI II CviRI 

III M I I || | 
CCTGTAGATTCTTAACTTACTAGTCTCTCCTTTCCTCTTGCTTTCTTTAATTTATTGCAG 
2941 + + + + + + 3000 

GGACATCTAAGAATTGAATGATCAGAGAGGAAAGGAGAACGAAAGAAATTAAATAACGTC 
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Figure 9: Nucleotide and amino acid sequence of CPN 100708 

ataaagtttt ttatatgaac aaaactttga cttcattggc agacacttcg accttaacag 60 

accaattttg ttgtcatccc tataaaaatc aggaattttc atg etc tec tea eta 115 

Met Leu Ser Ser Leu 
1 5 

ate cgt gat tea ttt ccc ctt ctt att tta ctt ccc aca ttc eta gcg 163 
lie Arg Asp Ser Phe Pro Leu Leu lie Leu Leu Pro Thr Phe Leu Ala 
10 15 20 

gca tta gga gee tec gta get ggc ggc gtt atg gga ace tat ate gtt 211 
Ala Leu Gly Ala Ser Val Ala Gly Gly Val Met Gly Thr Tyr lie Val 
25 30 35 

gta aaa cgt att gtt tea att agt gga agt ata tct cat gca att eta 25 9 
Val Lys Arg He Val Ser He Ser Gly Ser He Ser His Ala He Leu 
40 45 50 

gga gga att ggc etc acc eta tgg ata caa tat aag ctt cat etc tct 307 
Gly Gly He Gly Leu Thr Leu Trp He Gin Tyr Lys Leu His Leu Ser 
55 60 65 

ttt ttc cct atg tat gga get att gta gga get att ttt eta get ctt 355 
Phe Phe Pro Met Tyr Gly Ala He Val Gly Ala He Phe Leu Ala Leu 
70 75 80 85 

tgc ate ggc aag ate cac ctg aaa tac caa gaa agg gaa gac tct ttg 403 
Cys He Gly Lys He His Leu Lys Tyr Gin Glu Arg Glu Asp Ser Leu 
90 95 100 

att gcg atg att tgg tct gtg ggc atg gca att gga att ata ttc att 4 51 
He Ala Met He Trp Ser Val Gly Met Ala He Gly He He Phe He 
105 110 lis 

tec agg ctt ccc acc ttt aat gga gag etc ate aat ttt eta ttt ggg 4 99 
Ser Arg Leu Pro Thr Phe Asn Gly Glu Leu He Asn Phe Leu Phe Gly 
120 125 130 

aac att etc tgg gtc acc cct tea gac etc tat age tta" gga ate ttt 547 
Asn He Leu Trp Val Thr Pro Ser Asp Leu Tyr Ser Leu Gly He Phe 
135 140 145 

gat ctt ctt gtt tta gga att gtg gtc ctt tgc cac acc egg ttc ctt 595 
Asp Leu Leu Val Leu Gly He Val Val Leu Cys His Thr Arg Phe Leu 
150 155 160 165 

get ctt tgc tzz gar gac agg tac acg get c-a aac cat tgt tct gta 643 
Ala Leu Cys Phe Asp Glu Arg Tyr Thr Ala Leu Asn His Cys Ser Val 
170 175 180 

cag ctg tgg tat ttc eta ctt ctt gtt ctg aca gca ate acg att gcg 691 
Gin Leu Trp Tyr Phe Leu Leu Leu Val Leu Thr P-.la He Thr He Val 
185 190 1S5 
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atg ttg att tat qtg atg gga acg att ctg atg ctt age atg etc gtc 739 
Met Leu lie Tyr Yal Met Gly Thr lie Leu Met Leu Ser Met Leu Val 
200 205 210 

tta cct gtt get ata gcg tgt aga ttt teg tac aag atg aca cga att 787 * 
Leu Pro Val Ala lie Ala Cys Arg Phe Ser Tyr Lys Met Thr Arg lie 
215 220 225 

atg ttc ate teg gtc etc ttg aat ate tta tgt tct ttt tct gga att 835 
Met Phe lie Ser Val- Leu Leu Asn lie Leu Cys Ser Phe Ser Gly lie 
230 235 240 245 

tgc ate gec tac tgt tta gat ttc cca gta ggt cct acg ata tea ttg 8*83 
Cys lie Ala Tyr Cys Leu Asp Phe Pro Val Gly Pro Thr He Ser Leu 
250 255 260 

ctg atg ggg tta ggt tat aca gcg agt ctt tgt gtg aag aag egg tac 931 
Leu Met Gly Leu Gly Tyr Thr Ala Ser Leu Cys Val Lys Lys Arg Tyr 
265 270 275 

aat ccg teg acg cct tct cct gta agt cct gaa acc aat aca aat gta 979 
Asn Pro Ser Thr Pro Ser Pro Val Ser Pro Glu He Asn Thr Asn Val 
280 285 290 

tagctaggga agegcttttg gaagctttgg aggcattctt cctgttcgtc aggaagaaga 1039 

tcatcaattt tatttaaagc taccagcata tctttctttt caaaatctgg ctgatgagag 1099 

t 1100 
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Figure 10: Restriction enzyme analysis of CPN1 00708 



Bcgl 



TaqI Msel 
i 



ATAAAGTTTTTTATATGAACAAAACTTTGACTTCATTGGCAGACACTTCGACCTTAACAG 
1 + + + + + + go 

TATTTCAAAAAATATACTTGTTTTGAAACTGAAGTAACCGTCTGTGAAGCTGGAATTGTC 



Tsp509I 
Bcgl| 
Fokl | | 

Ml 



BseRI 
Apol | 
Tsp509I | 
Hpyl78III | |NlaIII 
I I I I 



Mnll 



ACC^^TTTTGTTGTCATCCCTATAAAAATCAGGAA.TTTTC^TGCTCTCOTCACTAATCCG 
61 + + + + + + 12Q 

TGGTTAAAACAACAGTAGGGATATTTTTAGTCCTTAAAAGTACGAGAGGAGTGATTAGGC 



BsaXI 
CviJI | 
NlaIV| J 
Mwol | j |MwoI 
I II I 



RleAI 
Fnu4HI 
Taul 

Hinfl Acil| 
Tf il Bf al | | 

I I II 

TGATTCATTTCCCCTTCTTATTTTACTTCCCACATTCCTAGCGGCATTAGGAGCCTCCGT 
121 + + + + + + 180 

ACTAAGTAAAGGGGAAGAATAAAATGAAGGGTGTAAGGATCGCCGTAATCCTCGGAGGCA 

Fnu4HI 
Taul 
Acil 
Cac8I 
Mnll 
Alul | 
CviJI | 

I I 



Maell 



Tsp509I 



NlalV 

I I I 
AGCTGGCGGCGTTATGGGAACCTATATCGTTGTAAAACGTATTGTTTCAATTAGTGGAAG 
181 + + + + + + 240 

TCGACCGCCGCAATACC CTTGGATATAGCAACATTTTG CATAACAAAGTTAATCACCTTC 



Bfal 
Mnll 
Tsp509I | 
CviRI | | 
Nlalllj | 

II 



CviJI 
Hael 

Haelll Alul 
HphI | CviJI 
|Tsp509l| | BciVI Mnll Hindi II 

ii i i i i 

TATATCTCATGCAATTCTAGGAGGAATTGGCCTCACCCTATGGATACAATATAAGCTTCA 
241 + + + + + + 300 

ATATAGAGTACGTTAAGATCCTCCTTAACCGGAGTGGGATACCTATGTTATATTCGAAGT 



BsaXI Alul 
Alul | Alul CviJI 
BslI CviJI | CviJI Bfal I CviRI 
I II | || | 
TCTCTCTTTTTTCCCTATGTATGGAGCTATTGTAGGAGCTATTTTTCTAGCTCTTTGCAT 
301 + + + + + + 360 

AGAGAGAAAAAAGGGATACATACCTCGATAACATCCTCGATAAAAAGATCGAGAAACGTA 



SUBSTITUTE SHEET (RULE 25) 



WO 00/32794 



Fig. 10 (con't) 



Title: CHLAMYDIA ANTIGENS AND 

CORRESPONDING DNA A 
FRAGMENTS AND USES THEIBIf 
Inventor(s): Andrew D. MURDIN et al. 
DOCKET NO.: 032931-0252 



QW% 571 

PCT/CA99/01147 



Dpnl 
BstYI 
Sau3AI 
SfaNI 
Alwl I 



Plel Bbsl 
Bsll| Hinfl iMboII RleAI 
II 



361 



CGGCAAGATCCACCTGAAATACCAAGAAAGGGAAGACTCTTTGATTGCGATGATTTGGTC 

+ + + + : + + 42Q 

GCCGTTCTAGGTGGACTTTATGGTTCTTTCCCTTCTGAGAAACTAACGCTACTAAACCAG 



421 



Banl I 
BsiHKAI 
Bspl286I 
Sad 
Alul | 
CviJI J 
Msel Hin4I j 

' ' I I I I f || 

TGTGGGCATGGCAATTGGAATTATATTCATTTCCAGGCTTCCCACCTTTAATGGAGAGCT 
+ + + + + + 

ACACCCGTACCGTTAACCTTAATATAAGTAAAGGTCCGAAGGGTGGAAATTACCTCTCGA 



Tsp509I 
Muni | 
Tsp509I j 
Nlalll | I 



CviJI 
Cjel | 
ScrFI | 
EcoRII | j 



Cjel 
Cjel | 
Tsp509I | j 

I I I 



BstEII 
Maelll 
HphI SimI 
Eco57I |Tsp45I 
I I 



Hinfl 
Mnll 
BpulOI 
Ddel 
Alul | 
Cjel CviJI j 
Hpyl88IX |SfcI I | |TfiI 



II 



481 



CMC^WITTTTCTATTTGGGAACATTCT 



GTAGTTAAAAGATAAACCCTTGTAAGAGACCCAGTGGGGAAGTCTGGAGATATCGAATCC 



■+ 540 



Sthl32I 
NlalV 

Dpnl Avail Mspl | 

Sau3AI | Cjel Neil j 

MboII | | Tsp509I Sau96I ScrFI I 

III I I | | . 
AATCT1TGATCTTCTTGTTTTAGGAATTGTGGTCCTTTGCCACACCCGGTTCCTTGCTCT 
541 + + + + + + 

TTAGAAACTAGAAGAACAAAATCCTTAACACCAGGAAACGGTGTGGGCCAAGGAACGAGA 



600 



Dral 
Msel | 
Rsal CviJI I 



Alul 
CviJI 
MspAlI 
PvuII 
Rsal 
BsrGI | 
Tat I j 
Bcefl I I 



601 



Mnll 
Cjel | 

II I 

TTGCTTTGATGAGAGGTACACGGCTTTAAACCATTGTTCTGTACAGCTGTGGTATTTCCT 

+ + + + + + 660 

AACGAAACTACTCTCCATGTGCCGAAATTTGGTAACAAGACATGTCGACACCATAAAGGA 
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HpylSSIX 
Hinfl | 

MslI SfaNl | | 

Hpyl88IX Hpyl78III | Bcgl Bed iTfil I 

i i i i iiii 

ACTTCTTGTTCTGACAGCAATCACGATTGTGATGTTGATTTATGTGATGGGAACGATTCT 

661 + + + + + + 720 

TGAAGAACAAGACTGTCGTTAGTGCTAACACTACAACTAAATACACTACCCTTGCTAAGA 

Nlalli 

Nspl 
SphI 

Bpull02I CacSI | 

Ddel MwoI| I Bcgl Sfcl Rsal 
I I! I I I | 
GATGCTTAGCATGCTCGTCTTACCTGTTGCTATAGCGTGTAGATTTTCGTACAAGATGAC 
721 + + + + + + 780 

CTACGAATCGTACGAGCAGAATGGAGAACGATATCGCACATCTAAAAGCATGTTCTACTG 



Taqll 
Tsp509I | 



Hpyl78III 
Avail | 
Sau96I Mnll 



Apol 
Tsp509I 
Hpyl78III | CviRI 



ACGAATTATGTTCATCTCGGTCCTCTTGAATATCTTATGTTCTTTTTCTGGAATTTGCAT 

781 + + + + + + 840 

TGCTTAATACAAGTAGAGCCAGGAGAACTTATAGAATACAAGAAAAAGACCTTAAACGTA 



Taal 
SfaNI | 

I I 



BsrDI 
EcoRV 
Pflll08I 
Avail 
EcoO109I 
PspSII 
Sau96I 
Sse8647I 
Bcgl | 
Bmrl | | 
Bsrl | | J 

111 I 



Bed 



Bcgl 



CGCCTACTGTTTAGATTTCCCAGTAGGTCCTACGATATCATTGCTGATGGGGTTAGGTTA 

841 + + + + + + 900 

GCGGATGACAAATCTAAAGGGTCATCCAGGATGCTATAGTAACGACTACCCCAATCCAAT 



BsaXI 
Hin4I | 
BsaHI 
Hindi 

MbOlI ACCI I 

Rsal | TaqI | 
Hinfl Plel Acil | | Sall|| 

I I I I I III 

TACAGCGAGTCTTTGTGTGAAGAAGCGGTACAATCCGTCGACGCCTTCTCCTGTAAGTCC 

901 + + + + + + 960 

ATGTCGCTCAGAAACACACTTCTTCGCCATGTTAGGCAGCTGCGGAAGAGGACATTCAGG 



Hgal Hpyl78III 
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Haell Alul 
Bf al Hhal | CviJl BsmI 

Alul| Eco47IIl|j Mnll| Mwol | 

CviJlj Mwol | | | Hindi II | j Mboll | | 

I I I Nl IN Ml 

TGAAATCAATACAAATGTATAGCTAGGGAAGCGCTTTTGGAAGCTTTGGAGGCATTCTTC 



ACTTTAGTTATGTTTACATATCGATCCCTTCGCGAAAACCTTCGAAACCTCCGTAAGAAG 



-+ 102 0 



Mboll 
Mboll 
Tsp509I 
Dpnl ) 
Hpyl78III Sau3AI | | 
I III 



Alul 
CviJI 
Dral | 
Msel| | 

II 



1021 



CTGTTCGTCAGGAAGAAGATCATCAATTTTATTTAAAGCTACCAGCATATCTTTCTTTTC 



GACAAGCAGTCCTTCTTCTAGTAGTTAAAATAAATTTCGATGGTCGTATAGAAAGAAAAG 



■+ 1080 



CviJI 



AAAAT CTGGCTGATGAGAGT 

1081 + + 1100 

TTTTAGAC CG ACTACTCTCA 



SUBSTITUTE SHEET (RULE 26) 



WO 00/32794 



Title: CHLAMYDIA ANTIGENS AND to /^fl ^ 7 T 

CORRESPONDING DNA A h~ " " -U -/ f X 

FRAGMENTS AND USES THErBI 
Inventor(s): Andrew D. MURDIN et al. 
DOCKET NO.: 032931-0252 



PCT/CA99/01 147 



Figure 1 1 : Identification of T- and B- cell epitopes from open reading frame ORF* 
cpn!00634 (SEQ ID NO: 11) 

1 MKTSIPWVLV SSVLAFSCHL QSI±ANEELLS PDDSFNGNID . SGTFTPKTSA 

51 TTYSLTGDVF FYEPGKGTPL SDSCFKQTTD NLTFLGNGHS LTFGF I DAGT 

101 HAGAAASTTA NKNLTFSGFS LLSFDSSPST TVTTGQGTLS SAGGVNLENI 

151 RKLWAGNFS TADGGAI KGA SFLLTGTSGD ALFSNNSSST KGGAIATTAG 

201 ARIANNTGYV RFLSNIASTS GGAIDDEGTS ILSNNKFLYF EGNAAKTTGG 

251 AICNTKASGS PELIISNNKT LIFASNVAET SGGAIHAKKL ALS SGGFTEF 

3 01 LRNNVSSATP KGGAISIDAS GELSLSAETG NITFVRNTLT TTGSTDTPKR 

3 51 NAINIGSNGK FTELRAAKNH TIFFYDPITS EGTSSDVLKI NNGSAGALNP 

401 YQGTILFSGE TLTADELKVA DNLKSSFTQP VSLSGGKLLL QKGVTLESTS 

451 FSQEAGSLLG MDSGTTLSTT AGSITITNLG INVDSLGLKQ PVSLTAKGAS 

501 NKVIVSGKUST LIDIEGNIYE SHMFSHDQLF SLLKITVDAD VDTNVDISSL 

551 IPVPAEDPNS EYGFQGQWNV NWTTDTATNT KEATATWTKT GFVPSPERKS 

601 ALVCNTLWGV FTDIRSLQQL VEIGATGMEH KQGFWVSSMT NFLHKTGDEN 

651 RKGFRHTSGG YVIGGSAHTP KDDLFTFAFC HLFARDKDCF IAHNNSRTYG 

701 GTLFFKHSHT LQPQNYLRLG RAKFSESAIE KFPREIPLAL DVQVSFSHSD 

751 NRMETHYTSL PESEGSWSNE CIAGGIGLDL PFVLSNPHPL FKTFIPQMKV 

801 EMVYVSQNSF FESSSDGRGF SIGRLLiNLSI PVGAKFVQGD IGDSYTYDLS 

851 GFFVSDVYRN NPQSTATLVM SPDSWKIRGG NLSRQAFLLR GSNNYVYNSN 
901 CELFGHYAME LRGSSRNYNV DVGTKLRF 



Possible T cell epitope: 
824 RLLMLSIPV 
Possible B cell epitope: 
644 HKTGDENRKGFRHTS 
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Figure 12: Identification of T- and B- ceil epitopes from open reading frame ORF* 
cpn100635 (SEQ ID.NO: 12) 

1 MKSQFSWLVL SSTLACFTSC STVFAATAEN IGPSDSFDGS TNTGTYTPKN 
51 TTTGIDYTLT GDITLQNLGD SAALTKGCFS DTTESLSFAG KGYSLSFLNI 
101 KSSAEGAALS VTTDKNLSLT GFS SLTFLAA PSSVITTPSG KGAVKCGGDL 
151 TFDNNGTILF KQDYCEENGG AISTKNLSLK NSTGSISFEG NKSSATGKKG 
201 GAI CATGTVD ITNNTAPTLF SNNIAEAAGG AINSTGNCTI TGNTSLVFSE 
251 NSVTATAGNG GALSGDADVT ISGNQSVTFS GNQAVANGGA IYAKKLTLAS 
301 GGGGGNPFSN NIVQGTTAGN GGAISILAAG ECSLFSEAGD HYLNGNAIVA 
351 TTPQTTKRNS IDIGSTGKDH ELRAISGHSI FFYDP I TANT AADSTDTLNL 
401 NKADAGNSTD YSGSIVFSGE KLSEDEAKVA DNLTSTLKQP VTLTAGNLVL 
451 KRGVTLDTKG FTQTAGSSVI MDAGTTLKAS TEEVTLTGLS I PVDSLGEGK 
501 KWIAASAAS KNVALSGPIL LLDNQGNAYE NHDLGKTQDF SFVQLSALGT 
551 ATTTDVPAVP TVATPTHYGY QGTWGMTWVD DTASTPKTKT ATLAWTNTGY 
601 LPNPERQGPL VPNSLWGSFS DIQAIQGVIE RSALTLCSDR GFWAAGVANF 
651 LDKDKKGEKR KYRHKSGGYA IGGAAQTCSE NLISFAFCQL FGSDKDFLVA 
701 KNHTDTYAGA FYIQHITECS GFI GCLLDKL PGSWSHKPLV LEGQLAYSHV 
751 SNDLKTKYTA YPEVKGSWGN NAFNMMLGAS SHSYPEYLHC FDTYAPYIKL 
801 NLTYIRQDSF SEKGTEGRSF DDSNLFNLSL PIGVKFEKFS DCNDFSYDLT 
851 LSYVPDLIRN DPKCTTALVI SGASWETYAN NLARQALQVR AGSHYAFSPM 
901 FEVLGQFVFE VRGSSRIYNV DLGGKFQF 



Possible T cell epitope: 
903 VLGQFVFEV 
Possible B cell epitope: 
DKDKKGEKRKYRHKSG 
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Figcre 13: Identification of T- and B- ceil epitopes from open reading frame ORF: 
cpn 100638 (SEQ ID NO: 14) 

1 MKSSLHWFLI SSSLALPLSL NFSAFAAWE INLGPTNSFS GPGTYTPPA^ 
51 TTNADGTIYN LTGDVSITNA GSPTALTASC FKETTGNLSF QGHGYQFIiLQ 
101 NIDAGANCTF TNTAANKLLS FSGFSYLSLI QTTNATTGTG AIKSTGACSI 
151 QSNYSCYFGQ NFSNDNGGAL QGSSISLSLN PNLTFAKNKA TQKGGAL.YST 
2 01 GGITINNTLN SASFSENTAA NNGGA1 YTEA SSFISSNKAI S FINNS VTAT 

2 51 SATGGAIYCS STSAPKPVLT LSDNGELNFI GNTAITSGGA IYTDNLVLSS 

3 01 GGPTLFKNNS GYDTAAPLGG AIAIADSGSL SLSALGGDIT FEGNTWKGA 

3 51 SSSQTTTRNS INIGNTNAKI VQLRASQGNT IYFYDPITTS ITAALSDALN 

4 01 LNGPDLAGNP AYQGTIVFSG EKLSEAEAAE ADNLKSTIQQ PLTLAGGQLS 
4 51 LKSGVTLVAK SFSQSPGSTL LMDAGTTLET ADGITINNLV LNVDSLKETK 
501 KGTLKATQAS QTVTLSGSLS LVDPSGNVYE DVSWNNPQVF SCLTLTADDP 
551 ANIHITDLAA DPLEKNPIHW GYQGNWALSW QEDTATKSKA ATLTWTKTGY 
601 NPNPERRGTL VANTLWGSFV DVRSIQQLVA TKVRQSQETR GIWCEGISNF 
651 FHKDSTKINK GFRHISAGYV VGATTTLASD NLITAAFCQL FGKDRDHF IN 
701 KNRASAYAAS LHLQHLATLS SPSLLRYLPG SESEQPVLFD AQISYIYSKN 
751 TMKTYYTQAP KGESSWYNDG CALELASSLP HTALSHEGLF HAYFPFIKVE 
801 ASYIHQDSFK ERNTTLVRSF DSGDLINVSV PIGITFERFS RNERASYEAT 
851 VI YVADVYRK NPDCTTALL I NNTS WKTTGT NLSRQAGIGR AGIFYAFSPN 
901 LEVTSNLSME IRGSSRSYNA DLGGKFQF 



Possible T cell epitope: 
614 TLWGSFVDV 
Possible B cell epitope: 
5 95 MTKTGYNPNPERRG 



SUBSTITUTE SHEET (RULE 26) 



T' ^ 7 J^jl^II' I:- Title: CHLAMYDIA ANTIGENS A 
WO 00/32794 A CORRESPONDING DNA 

^ FRAGMENTS AND USES THERE 

Inventor(s): Andrew D. MURDIN et al 
: ! DOCKET NO.: 032931-0252 



EOT 



^CT/^A^^I^tT^ 



Figure 14: Identification of T- and B- cell epitopes from open rea Jing frame ORF- 
cpnl 00639 (SEQ ID NO: 15) 

1 MKIPLHKLLI SSTLVTPILL S I ATYGADAS LSPtDSFDGA GGSTFTPKST 
51 ADANGTNYVL. SGNVYINDAG KGTALTGCCF TETTGDLTFT GKGYSFSFNT 
101 VDAGSNAGAA ASTTADKALI FTGFSNLSFI AAPGTTVASG KSTLSSAGAL 
151 NLTDNGTILF SQNVSNEANN NGGAI TTKTL SISGNTSSIT FTSNSAKKLG 
2 01 GAIYSSAAAS ISGNTGQLVF MNNKGETGGG ALGFEASSSI TQNSSLFFSG 
251 NTATDAAGKG GAI YCEKTGE TPTLTISGNK SLTFAENSSV TQGGAICAHG 
301 LDLSAAGPTL FSNNRCGNTA AGKGGAIAIA DSGSLSLSAN QGDITFLGNT 
351 LTSTSAPTST RNAIYLGSSA KITNLRAAQG QSIYFYDPIA SNTTGASDVL 
4 01 TINQPDSNSP LDYSGTIVFS GEKLSADEAK AADNFTSILK QPLALASGTL 
4 51 ALKGNVELDV NGFTQTEGST LLMQPGTKLK ADTEAISLTK LWDLSALEG 
501 NKSVS1ETAG ANKTITLTSP LVFQDSSGNF YESHTINQAF TQPLWFTAA 
551 TAASDIYIDA LLTSPVQTPE PHYGYQGHWE ATWADTSTAK SGTMTWVTTG 
601 YNPNPERRAS WPDSLWASF TDIRTLQQIM TSQANSIYQQ RGL.WASGTAN 
651 FFHKDKSGTN QAFRHKSYGY IVGGSAEDFS ENIFSVAFCQ LFGKDKDLFI 
701 VENTSHNYLA SLYLQHRAFL GGLPMPSFGS ITDMLKDIPL ILNAQLSYSY 
751 TKNDMDTRYT SYPEAQGSWT NNSGALELGG SLALYLPKEA PFFQGYFPFL 
801 KFQAVYSRQQ NFKESGAEAR AFDDGDLVNC SIPVGIRLEK ISEDEKNNFE 
851 ISLAYIGDVY RKNPRSRTSL MVSGASWTSL, CKNLARQAFli ASAGSHLTLS 
901 PHVELSGEAA YELRGSAHIY NVDCGLRYSF 



Possible T cell epitope: 

7 KLLISSTLV 
Possible B cell epitopes: 
839 EKISEDEKNNF 
860 YRKNPRSRT 
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Figure 15: Identification of T- and B- cell epitopes from open reading frame ORF:^ 
cpn100708 (SEQ ID NO: 16) 

1 MLSSLIRDSF PLLILLPTFL AALGASVAGG VMGTYIWKR IVSISGSISH 

51 AILGGIGLTL WIQYKLHLSF FPMYGAIVGA IFLALCIGKI HLKYQEREDS 

101 LIAMIWSVGM AIGIIFISRL PTFNGELINF LFGNILWVTP SDLYSLGIFD 

151 LLVLGIWLC HTRFLALCFD ERYTALNHCS VQLWYFLLLV LTAITIVMLI 

201 YVMGTILMLS MLVLPVAIAC RFSYKMTRIM FISVLLNILC SFSGICIAYC 

251 LDFPVGPTIS LLMGLGYTAS LCVKKRYNPS TPSPVSPEIN TNV 



Possible T cell epitope: 
13 0 FLFGNILWV 
Possible B cell epitope: 
91 HLKYQEREDS 
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<110> Connaught Laboratories Limited 

<120>- Chlamydia antigens and corresponding DNA fragments and uses thereof 

<130> 77813-2 

<140> 

<141> 

<150> US 60/110,427 
<150> US 60/110,438 
<150> US 60/110, 339 
<150> US 60/110,428 
<150> US 60/110,340 
<151> December 1, 1998 
<160> 16 

<170> Patentln Ver. 2.0 



<210> 1 
<211> 2950 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 

<221> CDS 

<222> (101) (2884) 
<400> 1 

gataaaaatt cttgacagct gttttgtcat ctttaacttg atttacttat tttgtttcta 60 

tattgatgcg aatagttctc taaaaaacaa aagcattacc atg aag act teg att 115 

Met Lys Thr Ser lie 
1 5 

cct tgg gtt tta gtt tec tec gtg tta get ttc tea tgt cac eta cag 163 
Pro Trp Val Leu Val Ser Ser Val Leu Ala Phe Ser Cys His Leu Gin 
10 15 20 

tea eta get aac gag gaa ctt tta tea cct gat gat age ttt aat gga 211 
Ser Leu Ala Asn Glu Glu Leu Leu Ser Pro Asp Asp Ser Phe Asn Gly 
25 30 35 

aat ate gat tea gga acg ttt act cca aaa act tea gee aca.aca tat 259 
Asn lie Asp Ser Gly Thr Phe Thr Pro Lys Thr Ser Ala Thr Thr Tyr 
40 45 50 

tct eta aca gga gat gtc ttc ttt tac gag cct gga aaa ggc act ccc 307 
Ser Leu Thr Gly Asp Val Phe Phe Tyr Glu Pro Gly Lys Gly Thr Pro 
55 60 65 

tta tct gac agt tgt ttt aag caa acc acg gac aat ctt acc ttc ttg 355 
Leu Ser Asp Ser Cys Phe Lys Gin Thr Thr Asp Asn Leu Thr Phe Leu 
7 0 75 80 85 

ggg aac ggt cat age tta acg ttt ggc ttt ata gat get ggc act cat 4 03 
Gly Asn Gly His Ser Leu Thr Phe Gly Phe lie Asp Ala Gly Thr His 
90 95 100 



1 
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gca ggt get get gca tct aca aca gca aat aag aat ctt acc ttc tea 451 
Ala Gly Ala Ala Ala Ser Thr Thr Ala Asn Lys Asn Leu Thr Phe Ser 
105 110 115 

ggg ttt tec tta ctg agt ttt gat tec tct cct age aca acg gtt act 4 99 
Gly Phe Ser Leu Leu Ser Phe Asp Ser Ser Pro Ser Thr Thr Val Thr 
120 125 130 

aca ggt cag gga acg ctt tec tea gca gga ggc gta aat tta gaa aat 547 
Thr Gly Gin Gly Thr Leu Ser Ser Ala Gly Gly Val Asn Leu Glu Asn 
135 140 145 

att cgt aaa ctt gta gtt get ggg aat ttt tct act gca gat ggt gga 595 
lie Arg Lys Leu Val Val Ala Gly Asn Phe Ser Thr Ala Asp Gly Gly 
150 155 160 165 

get ate aaa gga gcg tct ttc ctt tta act ggc act tct gga gat get 643 
Ala lie Lys Gly Ala Ser Phe Leu Leu Thr Gly Thr Ser Gly Asp Ala 
170 175 180 

ctt ttt agt aac aac tct tea tea aca aag gga gga gca att get act 691 
Leu Phe Ser Asn Asn Ser Ser Ser Thr Lys Gly Gly Ala lie Ala Thr 
185 190 195 

aca gca ggc get cgc ata gca aat aac aca ggt tat gtt aga ttc eta 739 
Thr Ala Gly Ala Arg lie Ala Asn Asn Thr Gly Tyr Val Arg Phe Leu 
200 205 210 

tct aac ata gcg tct acg tea gga ggc get ate gat gat gaa ggc acg 787 
Ser Asn lie Ala Ser Thr Ser Gly Gly Ala lie Asp Asp Glu Gly Thr 
215 220 225 

teg ata eta teg aac aac aaa ttt eta tat ttt gaa ggg aat gca gcg 835 
Ser lie Leu Ser Asn Asn Lys Phe Leu Tyr Phe Glu Gly Asn Ala Ala 
230 235 240 245 

aaa act act ggc ggt gcg ate tgc aac acc aag gcg agt gga tct cct 883 
Lys Thr Thr Gly Gly Ala lie Cys Asn Thr Lys Ala Ser Gly Ser Pro 
250 255 260 

gaa ctg ata ate tct aac aat aag act ctg ate ttt get tea aac gta 931 
Glu Leu lie lie Ser Asn Asn Lys Thr Leu lie Phe Ala Ser Asn Val 
265 270 275 

gca gaa aca age ggt ggc gee ate cat get aaa aag eta gee ctt tec 979 
Ala Glu Thr Ser Gly Gly Ala lie His Ala Lys Lys Leu Ala Leu Ser 
280 285 290 

tct gga ggc ttt aca gag ttt eta cga aat aat gtc tea tea gca act 1027 
Ser Gly Gly Phe Thr Glu Phe Leu Arg Asn Asn Val Ser Ser Ala Thr 
295 300 305 

cct aag ggg ggt get ate age ate gat gec tea gga gag etc agt etc 1075 
Pro Lys Gly Gly Ala lie Ser lie Asp Ala Ser Gly Glu Leu Ser Leu 
310 315 320 325 
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tct gca gag aca gga aac att acc ttt gta aga aat acc ctt aca aca 
Ser Ala Glu Thr Gly Asn He Thr Phe Val Arg Asn Thr Leu Thr Thr 
' 330 335 340 



1123 



acc gga agt acc gat act cct aaa cgt aat gcg ate aac ata gga agt 
Thr Gly Ser Thr Asp Thr Pro Lys Arg Asn Ala He Asn He Gly Ser 
345 350 355 



1171 



aac ggg aaa ttc acg gaa tta egg get get aaa aat cat aca att ttc 
Asn Gly Lys Phe Thr Glu Leu Arg Ala Ala Lys Asn His Thr He Phe 
360 365 370 



1219 



ttc tat gat ccc ate act tea gaa gga acc tea tea gac gta ttg aag 
Phe Tyr Asp Pro He Thr Ser Glu Gly Thr Ser Ser Asp Val Leu Lys 
375 380 385 



1267 



ata aat aac ggc tct gcg gga get etc aat cca tat caa gga acg att 
He Asn Asn Gly Ser Ala Gly Ala Leu Asn Pro Tyr Gin Gly Thr He 
390 395 400 405 



1315 



eta ttt tct gga gaa acc eta aca gca gat gaa ctt aaa gtt get gac 
Leu Phe Ser Gly Glu Thr Leu Thr Ala Asp Glu Leu Lys Val Ala Asp 
410 415 420 



1363 



aat tta aaa tct tea ttc acg cag cca gtc tec eta tec gga gga aag 
Asn Leu Lys Ser Ser Phe Thr Gin Pro Val Ser Leu Ser Gly Gly Lys 
425 430 435 



1411 



tta ttg eta caa aag gga gtc act tta gag age acg age ttc tct caa 
Leu Leu Leu Gin Lys Gly Val Thr Leu Glu Ser Thr Ser Phe Ser Gin 
440 445 450 



1459 



gag gec ggt tct etc etc ggc atg gat tea gga acg aca tta tea act 
Glu Ala Gly Ser Leu Leu Gly Met Asp Ser Gly Thr Thr Leu Ser Thr 
455 460 465 



1507 



aca get ggg agt att aca ate acg aac eta gga ate aat gtt gac tec 
Thr Ala Gly Ser He Thr He Thr Asn Leu Gly He Asn Val Asp Ser 
470 475 480 485 



1555 



tta ggt ctt aag cag ccc gtc age eta aca gca aaa ggt get tea aat 
Leu Gly Leu Lys Gin Pro Val Ser Leu Thr Ala Lys Gly Ala Ser Asn 
490 495 500 



1603 



aaa gtg ate gta tct ggg aag etc aac ctg att gat att gaa ggg aac 
Lys Val He Val Ser Gly Lys Leu Asn Leu He Asp He Glu Gly Asn 
505 510 515 



1651 



att tat gaa agt cat atg ttc age cat gac cag etc ttc tct eta tta 
He Tyr Glu.: Ser His Met Phe Ser His Asp Gin Leu Phe Ser Leu Leu 
520 525 530 



1699 



aaa ate acg gtt gat get gat gtt gat act aac gtt gac ate age age 
Lys He Thr Val Asp Ala Asp Val Asp Thr Asn Val Asp He Ser Ser 
535 540 545 



1747 
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ctt ate cct gtt cct get gag gat cct aat tea gaa tac gga ttc caa 
Leu lie Pro Val Pro Ala Glu Asp Pro Asn Ser Glu Tyr* Gly Phe Gin 
550 555 560 565 



1795 



gga caa tgg aat gtt aat tgg act acg gat aca get aca aat aca aaa 
Gly Gin Trp Asn Val Asn Trp Thr Thr Asp Thr Ala Thr Asn Thr Lys 
570 575 580 



1843 



gag gec acg gca act tgg ace aaa aca gga ttt gtt ccc age ccc gaa 
Glu Ala Thr Ala Thr Trp Thr Lys Thr Gly Phe Val Pro Ser Pro Glu 
585 590 595 



1891 



aga aaa tct gcg tta gta tgc aat ace eta tgg gga gtc ttt act gac 
Arg Lys Ser Ala Leu Val Cys Asn Thr Leu Trp Gly Val Phe Thr Asp 
600 605 610 



1939 



att cgc tct ctg caa cag ctt gta gag ate ggc gca act ggt atg gaa 
lie Arg Ser Leu Gin Gin Leu Val Glu lie Gly Ala Thr Gly Met Glu 
615 620 625 



1987 



cac aaa caa ggt ttc tgg gtt tec tec atg acg aac ttc ctg cat aag 
His Lys Gin Gly Phe Trp Val Ser Ser Met Thr Asn Phe Leu His Lys 
630 635 640 645 



2035 



act gga gat gaa aat cgc aaa ggc ttc cgt cat ace tct gga ggc tac 
Thr Gly Asp Glu Asn Arg Lys Gly Phe Arg His Thr Ser Gly Gly Tyr 
650 655 660 



2083 



gtc ate ggt gga agt get cac act cct aaa gac gac eta ttt acc ttt 
Val lie Gly Gly Ser Ala His Thr Pro Lys Asp Asp Leu Phe Thr Phe 
665 670 675 



2131 



gcg ttc tgc cat etc ttt get aga gac aaa gat tgt ttt ate get cac 
Ala Phe Cys His Leu Phe Ala Arg Asp Lys Asp Cys Phe lie Ala His 
680 685 690 



2179 



aac aac tct aga acc tac ggt gga act tta ttc ttc aag cac tct cat 
Asn Asn Ser Arg Thr Tyr Gly Gly Thr Leu Phe Phe Lys His Ser His 
695 700 705 



2227 



acc eta caa ccc caa aac tat ttg aga tta gga aga gca aag ttt tct 
Thr Leu Gin Pro Gin Asn Tyr Leu Arg Leu Gly Arg Ala Lys Phe Ser 
710 715 720 725 



2275 



gaa tea get ata gaa aaa ttc cct agg gaa att ccc eta gee ttg gat 
Glu Ser Ala lie Glu Lys Phe Pro Arg Glu lie Pro Leu Ala Leu Asp 
730 735 740 



2323 



gtc caa gtt teg ttc age cat tea gac aac cgt atg gaa acg cac tat 
Val Gin Val Ser Phe Ser His Ser Asp Asn Arg Met Glu Thr His Tyr 
745 750 755 



2371 



acc tea ttg cca gaa tec gaa ggt tct tgg age aac gag tgt ata get 
Thr Ser Leu Pro Glu Ser Glu Gly Ser Trp Ser Asn Glu Cys lie Ala 
760 765 770 



2419 
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ggt ggt ate ggc eta gac ctt ccz ttt gtt ctt tec aac cca cat cct 24 67 
Gly Gly lie Gly Leu Asp Leu Pro Phe Val Leu Ser Asn Pro His Pro 
775 780 785 

ctt ttc aag ace ttc att cca cag atg aaa gtc gaa atg gtt tat gta 2515 
Leu Phe Lys Thr Phe lie Pro Gin Met Lys Val Glu Met Val Tyr Val 
790 795 800 805 

tea caa aat age ttc ttc gaa age tct agt gat ggc cgt ggt ttt agt 2563 
Ser Gin Asn Ser Phe Phe Glu Ser Ser Ser Asp Gly Arg Gly Phe Ser 
810 815 820 

att gga agg ctg ctt aac etc teg att cct gtg ggt gcg aaa ttc gtg 2611 
lie Gly Arg Leu Leu Asn Leu Ser lie Pro Val Gly Ala Lys Phe Val 
825 830 835 

cag ggg gat ate gga gat tec tac ace tat gat etc tea gga ttc ttt 2659 
Gin Gly Asp lie Gly Asp Ser Tyr Thr Tyr Asp Leu Ser Gly Phe Phe 
840 845 850 

gtt tec gat gtc tat cgt aac aat ccc caa tct aca gcg act ctt gtg 2707 
Val Ser Asp Val Tyr Arg Asn Asn Pro Gin Ser Thr Ala Thr Leu Val 
855 860 865 



atg age cca gac tct tgg aaa att cgc ggt ggc aat ctt tea aga cag 
Met Ser Pro Asp Ser Trp Lys He Arg Gly Gly Asn Leu Ser Arg Gin 
870 875 880 885 



2755 



gca ttt tta ctg agg ggt age aac aac tac gtc tac aac tec aat tgt 2803 

Ala Phe Leu Leu Arg Gly Ser Asn Asn Tyr Val Tyr Asn Ser Asn Cys 
890 895 900 

gag etc ttc gga cat tac get atg gaa etc cgt gga tct tea agg aac 2851 

Glu Leu Phe Gly His Tyr Ala Met Glu Leu Arg Gly Ser Ser Arg Asn 
905 910 915 

tac aat gta gat gtt ggt ace aaa etc cga ttc tagattgeta aaactcccta 2904 

Tyr Asn Val Asp Val Gly Thr Lys Leu Arg Phe 
920 925 



gttcttctag ggagttttct catactttta gggaaatatt tgctat 



2950 



<210> 2 
<211> 2784 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 

<221> CDS 

<222> (1) . . (2784) 

<400> 2 

atgaagactt cgattccttg ggttttagtt tcctccgtgt tagctttctc atgtcaccta 60 
cagtcactag ctaacgagga acttttatca cctgatgata gctttaatgg aaatatcgat 120 
teaggaaegt ttactccaaa aacttcagcc acaacatatt ctctaacagg agatgtcttc 180 
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ttttacgagc 

aatcttacct 

catgcaggtg 

ttactgagtt 

tcagcaggag 

ctgcagatgg 

tctttttagt 

tcgcatagca 

aggcgctatc 

agggaatgca 

tgaactgata 

cggtggcgcc 

acgaaataat 

agagctcagt 

aaccggaagt 

cacggaatta 

aggaacctca 

tcaaggaacg 

caatttaaaa 

aaagggagtc 

ggattcagga 

caatgttgac 

taaagtgatc 

tcatatgttc 

tgatactaac 

atacggattc 

agaggccacg 

gttagtatgc 

agagatcggc 

cttcctgcat 

cgtcatcggt 

tctctttgct 

ggaactttat 

agagcaaagt 

gatgtccaag 

ccagaatccg 

ccttttgttc 

gaaatggttt 

agtattggaa 

atcggagatt 

aatccccaat 

aatctttcaa 

tgtgagctct 

gatgttggta 



ctggaaaagg 

tcttggggaa 

ctgctgcatc 

ttgattcctc 

gcgtaaattt 

tggagctatc 

aacaactctt 

aataacacag 

gatgatgaag 

gcgaaaacta 

atctctaaca 

atccatgcta 

gtctcatcag 

ctttctgcag 

accgatactc 

cgggctgcta 

tcagacgtat 

attctatttt 

tcttcattca 

actttagaga 

acgacattat 

tccttaggtc 

gtatctggga 

agccatgacc 

gttgacatca 

caaggacaat 

gcaacttgga 

aataccctat 

gcaactggta 

aagactggag 

ggaagtgctc 

agagacaaag 

tcttcaagca 

tttctgaatc 

tttcgttcag 

aaggttcttg 

tttccaaccc 

atgtatcaca 

ggctgcttaa 

cctacaccta 

ctacagcgac 

gacaggcatt 

tcggacatta 

ccaaactccg 



cactccctta 

cggtcatagc 

tacaacagca 

tcctagcaca 

agaaaatatt 

aaaggagcgt 

catcaacaaa 

gttatgttag 

gcacgtcgat 

ctggcggtgc 

ataagactct 

aaaagctagc 

caactcctaa 

agacaggaaa 

ctaaacgtaa 

aaaatcatac 

tgaagataaa 

ctggagaaac 

cgcagccagt 

gcacgagctt 

caactacagc 

ttaagcagcc 

agctcaacct 

agctcttctc 

gcagccttat 

ggaatgttaa 

ccaaaacagg 

ggggagtctt 

tggaacacaa 

atgaaaatcg 

acactcctaa 

attgttttat 

ctctcatacc 

agctatagaa 

ccattcagac 

gagcaacgag 

acatcctctt 

aaatagcttc 

cctctcgatt 

tgatctctca 

tcttgtgatg 

tttactgagg 

cgctatggaa 

attc 



tctgacagnt 

ttaacgtttg 

aataagaatc 

acggttacta 

cgtaaacttg 

ctttccttta 

gggaggagca 

attcctatct 

actatcgaac 

gatctgcaac 

gatctttgct 

cctttcctct 

ggggggtgct 

cattaccttt 

tgcgatcaac 

aattttcttc 

taacggctct 

cctaacagca 

ctccctatcc 

ctctcaagag 

tgggagtatt 

cgtcagccta 

gattgatatt 

tctattaaaa 

ccctgttcct 

ttggactacg 

atttgttccc 

tactgacatt 

acaaggtttc 

caaaggcttc 

agacgaccta 

atcgctcaca 

ctacaacccc 

aaattcccta 

aaccgtatgg 

tgtatagctg 

ttcaagacct 

ttcgaaagct 

cctgtgggtg 

ggattctttg 

agcccagact 

ggtagcaaca 

ctccgtggat 



gttttaagca 

gctttataga 

ttaccttctc 

caggtcaggg 

tagttgctgg 

actggcactt 

attgctacta 

aacatagcgt 

aacaaatttc 

accaaggcga 

tcaaacgtag 

ggaggcttta 

atcagcatcg 

gtaagaaata 

ataggaagta 

tatgatccca 

gcgggagctc 

gatgaactta 

ggaggaaagt 

gccggttctc 

acaatcacga 

acagcaaaag 

gaagggaaca 

atcacggttg 

gctgaggatc 

gatacagcta 

agccccgaaa 

cgctctctgc 

tgggtttcct 

cgtcatacct 

tttacctttg 

acaactctag 

aaaactattt 

gggaaattcc 

aaacgcacta 

gtggtatcgg 

tcattccaca 

ctagtgatgg 

cgaaattcgt 

tttccgatgt 

cttggaaaat 

actacgtcta 

cttcaaggaa 



aaccacggac 
tgctggcact 

agggttttcc 

aacgcttt'cc 

gaatttttca 

ctggagatgc 

cagcaggcgc 

ctacgtcagg 

tatattttga 

gtggatctcc 

cagaaacaag 

cagagtttct 

atgcctcagg 

cccttacaac 

acgggaaatt 

tcacttcaga 

tcaatccata 

aagttgctga 

tattgctaca 

tcctcggcat 

acctaggaat 

gtgcttcaaa 

tttatgaaag 

atgctgatgt 

ctaattcaga 

caaatacaaa 

gaaaatctgc 

aacagcttgt 

ccatgacgaa 

ctggaggcta 

cgttctgcca 

aacctacggt 

gagattagga 

cctagccttg 

tacctcattg 

cctagacctt 

gatgaaagtc 

ccgtggtttt 

gcagggggat 

ctatcgtaac 

tcgcggtggc 

caactccaat 

ctacaatgta 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2784 
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agttaaagat gacaaaacag ctgtcaagaa tttttatctt gactctctga gttttctatt 60 

ttatatgacg caagtaagaa tttaataata aagtgggttt atg aaa teg caa ttt 115 

Met Lys Ser Gin Phe 
1 5 

tec tgg tta gtg etc tct teg aca ttg gca tgt ttt act agt tgt tec 163 
Ser Trp Leu Val Leu Ser Ser Thr Leu Ala Cys Phe Thr Ser Cys Ser 
10 15 20 

act gtt ttt get gca act get gaa aat ata ggc ccc tct gat age ttt 211 
Thr Val Phe Ala Ala Thr Ala Glu Asn lie Gly Pro Ser Asp Ser Phe 
25 30 35 

gac gga agt act aac aca ggc acc tat act cct aaa aat acg act act 259 
Asp Gly Ser Thr Asn Thr Gly Thr Tyr Thr Pro Lys Asn Thr Thr Thr 
40 45 50 

gga ata gac tat act ctg aca gga gat ata act ctg caa aac ctt ggg 307 
Gly lie Asp Tyr Thr Leu Thr Gly Asp lie Thr Leu Gin Asn Leu Gly 
55 60 65 

gat teg gca get tta acg aag ggt tgt ttt tct gac act acg gaa tct 355 
Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser Asp Thr Thr Glu Ser 
70 75 80 85 

tta age ttt gee ggt aag ggg tac tea ctt tct ttt tta aat att aag 403 
Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser Phe Leu Asn lie Lys 
90 95 100 

tct agt get gaa ggc gca gec ctt tct gtt aca act gat aaa aat ctg 451 
Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr Thr Asp Lys Asn Leu 
105 110 115 

teg eta aca gga ttt teg agt ctt act ttc tta gcg gec cca tea teg 4 99 
Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu Ala Ala Pro Ser Ser 
120 125 130 

gta ate aca acc ccc tea gga aaa ggt gca gtt aaa tgt gga ggg gat 547 
Val lie Thr Thr Pro Ser Gly Lys Gly Ala Val Lys Cys Gly Gly Asp 
135 140 145 

ctt aca ttt gat aac aat gga act att tta ttt aaa caa gat tac tgt 595 
Leu Thr Phe Asp Asn Asn Gly Thr lie Leu Phe Lys Gin Asp Tyr Cys 
150 155 160 165 

gag gaa aat ggc gga gee att tct acc aag aat ctt tct ttg aaa aac 643 
Glu Glu Asn Gly Gly Ala lie Ser Thr Lys Asn Leu Ser Leu Lys Asn 
170 175 180 

age acg gga teg att tct ttt gaa ggg aat aaa teg age gca aca ggg 691 
Ser Thr Gly Ser lie Ser Phe Glu Gly Asn Lys Ser Ser Ala Thr Gly 
185 190 195 

aaa aaa ggt ggg get att tgt get act ggt act gta gat att aca aat 739 
Lys Lys Gly Gly Ala lie Cys Ala Thr Gly Thr Val Asp lie Thr Asn 
200 205 210 
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aat acg get cct acc etc ttc teg aac aat att get gaa get gca ggt 787 
Asn Thr Ala Pro Thr Leu Phe Ser Asn Asn lie Ala Glu Ala Ala Gly 
215 220 225 

gga get ata aat age aca gga aac tgt aca att aca ggg aat acg tct 835 
Gly Ala lie Asn Ser Thr Gly Asn Cys Thr lie Thr Gly Asn Thr Ser 
230 235 240 245 

ctt gta ttt tct gaa aat agt gtg aca gcg acc gca gga aat gga gga 883 
Leu Val Phe Ser Glu Asn Ser Val Thr Ala Thr Ala Gly Asn Gly Gly 
250 255 260 

get ctt tct gga gat gee gat gtt acc ata tct ggg aat cag agt gta 931 
Ala Leu Ser Gly Asp Ala Asp Val Thr lie Ser Gly Asn Gin Ser Val 
265 270 275 

act ttc tea gga aac caa get gta get aat ggc gga gee att tat get 979 
Thr Phe Ser Gly Asn Gin Ala Val Ala Asn Gly Gly Ala lie Tyr Ala 
280 285 290 

aag aag ctt aca ctg get tec ggg ggg ggg ggg ggg aat ccc ttt tct 1027 
Lys Lys Leu Thr Leu Ala Ser Gly Gly Gly Gly Gly Asn Pro Phe Ser 
295 300 305 

aac aat ata gtc caa ggt acc act gca ggt aat ggt gga gee att tct 1075 
Asn Asn lie Val Gin Gly Thr Thr Ala Gly Asn Gly Gly Ala lie Ser 
310 315 320 325 

ata ctg gca get gga gag tgt agt ctt ttc age gaa gca ggg gac cat 1123 
lie Leu Ala Ala Gly Glu Cys Ser Leu Phe Ser Glu Ala Gly Asp His 
330 335 340 



tac ctt aat ggg aat gec att gtt gca act aca cca caa act aca aaa 
Tyr Leu Asn Gly Asn Ala lie Val Ala Thr Thr Pro Gin Thr Thr Lys 
345 350 355 



1171 



aga aat tct att gac ata gga tct act ggc aaa gat cac gaa tta cgt 1219 
Arg Asn Ser lie Asp lie Gly Ser Thr Gly Lys Asp His Glu Leu Arg 
360 365 370 

gca ata tct ggg cat age ate ttt ttc tac gat ccg att act get aat 1267 
Ala lie Ser Gly His Ser He Phe Phe Tyr Asp Pro He Thr Ala Asn 
375 380 385 

acg get gcg gat tct aca gat act tta aat etc aat aag get gat gca 1315 
Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu Asn Lys Ala Asp Ala 
390 395 400 405 

ggt aat agt aca gat tat agt ggg teg att gtt ttt tct ggt gaa aag 1363 
Gly Asn Ser Thr Asp Tyr Ser Gly Ser He Val Phe Ser Gly Glu Lys 
410 415 420 

etc tct gaa gat gaa gca aaa gtt gca gac aac etc act tct acg ctg 1411 
Leu Ser Glu Asp Glu Ala Lys Val Ala Asp Asn Leu Thr Ser Thr Leu 
425 430 435 
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aag cag cct ' gta act eta act gca gga aat tta gta ctt aaa cgt ggt 14 59 
Lys Gin Pro Val Thr Leu Thr Ala Gly Asn Leu Val Leu Lys Arg Gly 
440 445 450 

gtc act etc gat acg aaa ggc ttt act cag acc gcg ggt tec tct gtt 1507 
Val Thr Leu Asp Thr Lys Gly Phe Thr Gin Thr Ala Gly Ser Ser Val 
455 460 .465 

att atg gat gcg ggc aca acg tta aaa gca agt aca gag gag gtc act 1555 
lie Met Asp Ala Gly Thr Thr Leu Lys Ala Ser Thr Glu Glu Val Thr 
470 475 480 485 

tta aca ggt ctt tec att cct gta gac tct tta ggc gag ggt aag aaa 1603 
Leu Thr Gly Leu Ser lie Pro Val Asp Ser Leu Gly Glu Gly Lys Lys 
490 495 500 

gtt gta att get get tct gca gca agt aaa aat gta gec ctt agt ggt 1651 
Val Val lie Ala Ala Ser Ala Ala Ser Lys Asn Val Ala Leu Ser Gly 
505 510 515 

ccg att ctt ctt ttg gat aac caa ggg aat get tat gaa aat cac gac 1699 
Pro lie Leu Leu Leu Asp Asn Gin Gly Asn Ala Tyr Glu Asn His Asp 
520 525 530 

tta gga aaa act caa gac ttt tea ttt gtg cag etc tct get ctg ggt 1747 
Leu Gly Lys Thr Gin Asp Phe Ser Phe Val Gin Leu Ser Ala Leu Gly 
535 540 545 

act gca aca act aca gat gtt cca gcg gtt cct aca gta gca act cct 17 95 
Thr Ala Thr Thr Thr Asp Val Pro Ala Val Pro Thr Val Ala Thr Pro 
550 555 560 565 

acg cac tat ggg tat caa ggt act tgg gga atg act tgg gtt gat gat 1843 
Thr His Tyr Gly Tyr Gin Gly Thr Trp Gly Met Thr Trp Val Asp Asp 
570 575 580 

acc gca age act cca aag act aag aca gcg aca tta get tgg acc aat 1891 
Thr Ala Ser Thr Pro Lys Thr Lys Thr Ala Thr Leu Ala Trp Thr Asn 
585 590 595 

aca ggc tac ctt ccg aat cct gag cgt caa gga cct tta gtt cct aat 1939 
Thr Gly Tyr Leu Pro Asn Pro Glu Arg Gin Gly Pro Leu Val Pro Asn 
600 605 610 

age ctt tgg gga tct ttt tea gac ate caa gcg att caa ggt gtc ata 1987 
Ser Leu Trp Gly Ser Phe Ser Asp lie Gin Ala lie Gin Gly Val He 
615 620 625 

gag aga agt get ttg act ctt tgt tea gat cga ggc ttc tgg get gcg 2035 
Glu Arg Ser Ala Leu Thr Leu Cys Ser Asp Arg Gly Phe Trp Ala Ala 
-630 635 640 645 

gga gtc gee aat ttc tta gat aaa gat aag aaa ggg gaa aaa cgc aaa 2083 
Gly Val Ala Asn Phe Leu Asp Lys Asp Lys Lys Gly Glu Lys Arg Lys 
650 655 660 
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tac cgt cat aaa tct ggt gga tat get ate gga ggt gca gcg caa act 2131 
Tyr Arg His Lys Ser Gly Gly Tyr Ala lie Gly Gly Ala Ala Gin Thr 
665 670 675 

tgt tct gaa aac tta att age ttt gec ttt tgc caa etc ttt ggt age 2179 
Cys Ser Glu Asn Leu lie Ser Phe Ala Phe Cys Gin Leu Phe Gly Ser 
680 685 690 

gat aaa gat ttc tta gtc get aaa aat cat act gat ace tat gca gga 2227 
Asp Lys Asp Phe Leu Val Ala Lys Asn His Thr Asp Thr Tyr Ala Gly 
695 700 705 

gec ttc tat ate caa cac att aca gaa tgt agt ggg ttc ata ggt tgt 2275 
Ala Phe Tyr lie Gin His lie Thr Glu Cys Ser Gly Phe lie Gly Cys 
710 715 720 725 

etc tta gat aaa ctt cct ggc tct tgg agt cat aaa ccc etc gtt tta 2323 
Leu Leu Asp Lys Leu Pro Gly Ser Trp Ser His Lys Pro Leu Val Leu 
730 735 740 

gaa ggg cag etc get tat age cac gtc agt aat gat ctg aag aca aag 2371 
Glu Gly Gin Leu Ala Tyr Ser His Val Ser Asn Asp Leu Lys Thr Lys 
745 750 755 

tat act gcg tat cct gag gtg aaa ggt tct tgg ggg aat aat get ttt 2419 
Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp Gly Asn Asn Ala Phe 
760 765 770 

aac atg atg ttg gga get tct tct cat tct tat cct gaa tac ctg cat 24 67 
Asn Met Met Leu Gly Ala Ser Ser His Ser Tyr Pro Glu Tyr Leu His 
775 780 785 



tgt ttt gat acc tat get cca tac ate aaa ctg aat ctg ace tat ata 
Cys Phe Asp Thr Tyr Ala Pro Tyr lie Lys Leu Asn Leu Thr Tyr lie 
790 795 800 805 



2515 



cgt cag gac age ttc teg gag aaa ggt aca gaa gga aga tct ttt gat 2563 
Arg Gin Asp Ser Phe Ser Glu Lys Gly Thr Glu Gly Arg Ser Phe Asp 
810 815 820 

gac age aac etc ttc aat tta tct ttg cct ata ggg gtg aag ttt gag 2611 
Asp Ser Asn Leu Phe Asn Leu Ser Leu Pro lie Gly Val Lys Phe Glu 
825 830 835 

aag ttc tct gat tgt aat gac ttt tct tat gat ctg act tta tec tat 2659 
Lys Phe Ser Asp Cys Asn Asp Phe Ser Tyr Asp Leu Thr Leu Ser Tyr 
840 845 850 

gtt cct gat ctt ate cgc aat gat ccc aaa tgc act aca gca ctt gta 2707 
Val Pro Asp Leu lie Arg Asn Asp Pro Lys Cys Thr Thr Ala Leu Val 
855 860 865 

ate age gga gee tct tgg gaa act tat gee aat aac tta gca cga cag 2755 
lie Ser Gly Ala Ser Trp Glu Thr Tyr Ala Asn Asn Leu Ala Arg Gin 
870 875 880 885 
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gcc ttg caa gtg cgt gca ggc agt cac tac gcc ttc tct cct atg ttt 
Ala Leu Gin Val Arg Ala Gly Ser . His Tyr Ala Phe Ser Pro Met Phe 
; i90 895 900 



2803 



gaa gtg etc ggc cag ttt gtc ttt gaa gtt cgt gga tec tea egg att 
Glu Val Leu Gly Gin Phe Val Phe Glu Val Arg Gly Ser Ser Arg lie 
905 910. 915 



2851 



tat aat gta gat ctt ggg ggt aag ttc caa ttc taggagegtc tctcatgtct 2904 
Tyr Asn Val Asp Leu Gly Gly Lys Phe Gin Phe 
920 925 



cagaaattct gagagagatc gcatttagga ttttcttaaa cacgac 



2950 



<210>4 

<211>2950 

<212>DNA 

<2 13>Chlamydiapneumoniae 

<220> 

<221>CDS 

<222>{101) . . (2784) 
<400>4 

atgaaatege aattttcctg gttagtgctc tcttcgacat tggcatgttt tactagttgt 60 

tccactgttt ttgetgeaac tgctgaaaat ataggcccct ctgatagctt tgacggaagt 120 

actaacacag gcacctatac tcctaaaaat acgactactg gaatagacta tactctgaca 180 

ggagatataa etctgeaaaa ccttggggat teggcagett taacgaaggg ttgtttttct 240 

gaeactaegg aatctttaag etttgeeggt aaggggtact cactttcttt tttaaatatt 300 

aagtctagtg ctgaaggege agccctttct gttacaactg ataaaaatct gtegctaaca 360 

ggattttcga gtcttacttt cttagcggcc ccatcatcgg taatcacaac cccctcagga 420 

aaaggtgcag ttaaatgtgg aggggatctt acatttgata acaatggaac tattttattt 480 

aaacaagatt actgtgagga aaatggcgga gecatttcta ccaagaatct ttctttgaaa 540 

aacagcaegg gatcgatttc ttttgaaggg aataaatcga gegcaacagg gaaaaaaggt 600 

ggggctattt gtgctactgg tactgtagat attacaaata atacggctcc taccctcttc 660 

tcgaacaata ttgetgaage tgcaggtgga gctataaata gcacaggaaa ctgtacaatt 720 

acagggaata cgtctcttgt attttctgaa aatagtgtga cagcgaccgc aggaaatgga 780 

ggagctcttt ctggagatgc cgatgttacc atatctggga atcagagtgt aactttctca 84 0 

ggaaaccaag ctgtagctaa tggeggagee atttatgeta agaagcttac actggcttcc 900 

gggggggggg gggggaatcc cttttctaac aatatagtcc aaggtaccac tgcaggtaat 960 

ggtggagcca tttctatact ggcagctgga gagtgtagtc ttttcagega agcaggggac 1020 

cattacctta atgggaatgc cattgttgca actacaccac aaactacaaa aagaaattca 1080 

ttgacatagg atctactggc aaagatcacg attaegtgea atatctgggc atagcatctt 1140 

tttctacgat ccgattactg etaataegge tgeggattet acagatactt taaatctcaa 1200 

taaggctgat gcaggtaata gtacagatta tagtgggtcg attgtttttt ctggtgaaaa 1260 

gctctctgaa gatgaagcaa aagttgcaga caacctcact tetaegctga ageagectgt 1320 

aactctaact gcaggaaatt tagtacttaa acgtggtgtc actctcgata cgaaaggctt 1380 

tactcagacc gcgggttcct ctgttattat ggatgeggge acaacgttaa aagcaagtac 14 4 0 

agaggaggtc actttaacag gtctttccat tcctgtagac tetttaggeg agggtaagaa 1500 

agttgtaatt getgettctg cagcaagtaa aaatgtagcc cttagtggtc cgattcttct 1560 

tttggataac caagggaatg cttatgaaaa tcacgactta ggaaaaactc aagacttttc 1620 

atttgtgcag ctctctgctc tgggtactgc aacaactaca gatgttccag cggttcctac 1680 

agtagcaact cctacgcact atgggtatca aggtacttgg ggaatgactt gggttgatga 1740 

taccgcaagc actccaaaga ctaagacagc gacattagct tggaccaatc aggctacctt 1800 

ccgaatcctg agegtcaagg acctttagtt cctaatagcc tttggggatc tttttcagac 18 60 
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atccaagcga 
ttctgggctg 
taccgtcata 
ttaattagct 
aatcatactg 
ttcataggtt 
gaagggcagc 
cctgaggtga 
cattcttatc 
ctgacctata 
acagcaacct 
gtaatgactt 
ccaaatgcac 
aacttagcac 
tttgaagtgc 
gatcttgggg 



ttcaaggtgt 
cgggagtcgc 
aatctggtgg 
ttgccttttg 
atacctatgc 
gtctcttaga 
tcgcttatag 
aaggttcttg 
ctgaatacct 
tacgtcagga 
cttcaattta 
ttcttatgat 
tacagcactt 
gacaggcctt 
tcggccagtt 
gtaagttcca 



catagagaga 
caatttctta 
atatgctatc 
ccaactcttt 
aggagccttc 
taaacttcct 
ccacgtcagt 
ggggaataat 
gcattgtttt 
cagcttctcg 
tctttgccta 
ctgactttat 
gtaatcagcg 
gcaagtgcgt 
tgtctttgaa 
attc 



agtgctttga 
gataaagata 
ggaggtgcag 
ggtagcgata 
tatatccaac 
ggctcttgga 
aatgatctga 
gcttttaaca 
gatacctatg 
gagaaagtac 
taggggtgaa 
cctatgttcc 
gagcccactt 
gcaggcagtc 
gttcgtggat 



ctctttgtt'c 
agaaagggga 
cgcaaacttg 
aagatttctt 
acattacaga 
gtcataaacc 
agacaaagta 
tgatgttggg 
ctccatacat 
agaaggaaga 
gtttgagaag 
tgatcttatc 
cttgggaaac 
actacgcctt 
cctcacggat 



agatcgaggc 
aaaacgcaaa 
ttctgaaaac 
agtcgctaaa 
atgtagtggg 
cctcgtttta 
tactgcgtat 
agcttcttct 
caaactgaat 
tcttttgatg 
ttctctgatt 
cgcaatgatc 
ttatgccaat 
ctctcctatg 
ttataatgta 



19^0 
1980 
204C 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2784 



<210> 5 
<211> 2950 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 

<221> CDS 

<222> (101) . . (2884) 
<400> 5 

tgtagattct taacttacta gtctctcctt tcctcttgct ttctttaatt tattgcagta 60 

tgtggtgaaa taatttgtta aaccacctat agccctctac atg aaa tec tct ctt 115 

Met Lys Ser Ser Leu 
1 5 

cat tgg ttt tta ate teg tea tct tta gca ctt ccc ttg tea eta aat 163 
His Trp Phe Leu lie Ser Ser Ser Leu Ala Leu Pro Leu Ser Leu Asn 
10 15 20 

ttc tct gcg ttt get get gtt gtt gaa ate aat eta gga cct ace aat 211 
Phe Ser Ala Phe Ala Ala Val Val Glu lie Asn Leu Gly Pro Thr Asn 
25 30 35 

age ttc tct gga cca gga ace tac act cct cca gec caa aca aca aat 259 
Ser Phe Ser Gly Pro Gly Thr Tyr Thr Pro Pro Ala Gin Thr Thr Asn 
40 45 50 

gca gat gga act ate tat aat eta aca ggg gat gtc tea ate acc aat 307 
Ala Asp Gly Thr He Tyr Asn Leu Thr Gly Asp Val Ser He Thr Asn 
55 60 65 

gca gga tct ccg aca get eta acc get tec tgc ttt aaa gaa act act 355 
Ala Gly Ser Pro Thr Ala Leu Thr Ala Ser Cys Phe Lys Glu Thr Thr 
70 75 80 85 
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ggg aat ctt tct ttc caa ggc cac ggc tac caa ttt etc eta caa aat 403 
Gly Asn Leu Ser Phe Gin Gly His Gly Tyr Gin Phe Leu Leu Gin Asn 
90 95 100 

ate gat gcg gga gcg aac tgt acc ttt acc aat aca get gca aat aag 451 
lie Asp Ala Gly Ala Asn Cys Thr Phe Thr Asn Thr Ala Ala Asn Lys 
105 110 115 

ctt etc tec ttt tea gga ttc tec tat ttg tea eta ata caa acc acg 4 99 
Leu Leu Ser Phe Ser Gly Phe Ser Tyr Leu Ser Leu lie Gin Thr Thr 
120 125 130 

aat get acc aca gga aca gga gee ate aag tec aca gga get tgt tct 547 
Asn Ala Thr Thr Gly Thr Gly Ala lie Lys Ser Thr Gly Ala Cys Ser 
135 140 145 

att cag teg aac tat agt tgc tac ttt ggc caa aac ttt tct aat gac 595 
lie Gin Ser Asn Tyr Ser Cys Tyr Phe Gly Gin Asn Phe Ser Asn Asp 
150 155 160 165 

aat gga ggc gec etc caa ggc age tct ate agt eta teg eta aac ccc 643 
Asn Gly Gly Ala Leu Gin Gly Ser Ser lie Ser Leu Ser Leu Asn Pro 
170 175 180 

aac eta acg ttt gec aaa aac aaa gca acg caa aaa ggg ggt gee etc 691 
Asn Leu Thr Phe Ala Lys Asn Lys Ala Thr Gin Lys Gly Gly Ala Leu 
185 190 195 

tat tec acg gga ggg att aca att aac aat acg tta aac tea gca tea 739 
Tyr Ser Thr Gly Gly lie Thr lie Asn Asn Thr Leu Asn Ser Ala Ser 
200 205 210 

ttt tct gaa aat acc gcg gcg aac aat ggc gga gee att tac acg gaa 787 
Phe Ser Glu Asn Thr Ala Ala Asn Asn Gly Gly Ala lie Tyr Thr Glu 
215 220 225 

get age agt ttt att age age aac aaa gca att age ttt ata aac aat 835 
Ala Ser Ser Phe lie Ser Ser Asn Lys Ala lie Ser Phe lie Asn Asn 
230 235 240 245 

agt gtg acc gca acc tea get aca ggg gga gec att tac tgt agt agt 883 
Ser Val Thr Ala Thr Ser Ala Thr Gly Gly Ala lie Tyr Cys Ser Ser 
250 255 260 

aca tea gec ccc aaa cca gtc tta act eta tea gac aac ggg gaa ctg 931 
Thr Ser Ala Pro Lys Pro Val Leu Thr Leu Ser Asp Asn Gly Glu Leu 
265 270 275 

aac ttt ata gga aat aca gca att act agt ggt ggg gcg att tat act 97 9 
Asn Phe lie Gly Asn Thr Ala lie Thr Ser Gly Gly Ala lie Tyr Thr 
280 285 290 

gac aat eta gtt ctt tct tct gga gga cct acg ctt ttt aaa aac aac 1027 
Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr Leu Phe Lys Asn Asn 
295 300 305 
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tct ggc tat gat act. gca get ccc tta gga gga gca att gcg att ' get 1075 
Ser Gly Tyr Asp Thr Ala Ala Pro Leu Gly Gly Ala lie Ala lie Ala 
310 315 320 325 

gac tct gga tct ttg agt ctt teg get ctt ggt gga gac ate act ttt 1123 
Asp Ser Gly Ser Leu Ser Leu Ser Ala Leu Gly Gly Asp lie Thr Phe 
330 335 340 

gaa gga aac aca gta gtc aaa gga get tct teg agt cag ace act ace 1171 
Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser Ser Gin Thr Thr Thr 
345 350 355 

aga aat tct att aac ate gga aac acc aat get aag att gta cag ctg 1219 
Arg Asn Ser lie Asn lie Gly Asn Thr Asn Ala Lys lie Val Gin Leu 
360 365 370 

cga gec tct caa ggc aat act ate tac ttc tat gat cct ata aca act 1267 
Arg Ala Ser Gin Gly Asn Thr lie Tyr Phe Tyr Asp Pro lie Thr Thr 
375 380 385 

age ate act gca get etc tea gat get eta aac tta aat ggt cct gac 1315 
Ser lie Thr Ala Ala Leu Ser Asp Ala Leu Asn Leu Asn Gly Pro Asp 
390 395 400 405 

ctt gca ggg aat cct gca tat caa gga acc ate gta ttt tct gga gag 1363 
Leu Ala Gly Asn Pro Ala Tyr Gin Gly Thr lie Val Phe Ser Gly Glu 
410 415 420 

aag etc teg gaa gca gaa get gca gaa get gat aat etc aaa tct aca 1411 
Lys Leu Ser Glu Ala Glu Ala Ala Glu Ala Asp Asn Leu Lys Ser Thr 
425 430 435 

att cag caa cct eta act ctt gcg gga ggg caa etc tct ctt aaa tea 1459 
lie Gin Gin Pro Leu Thr Leu Ala Gly Gly Gin Leu Ser Leu Lys Ser 
440 445 450 

gga gtc act eta gtt get aag tec ttt teg caa tct ccg ggc tct acc 1507 
Gly Val Thr Leu Val Ala Lys Ser Phe Ser Gin Ser Pro Gly Ser Thr 
455 . 460 465 

etc etc atg gat gca ggg acc aca tta gaa acc get gat ggg ate act 1555 
Leu Leu Met Asp Ala Gly Thr Thr Leu Glu Thr Ala Asp Gly lie Thr 
470 475 480 485 

ate aat aat ctt gtt etc aat gta gat tec tta aaa gag acc aag aag 1603 
He Asn Asn Leu Val Leu Asn Val Asp Ser Leu Lys Glu Thr Lys Lys 
490 495 500 

ggc acg eta aaa gca aca caa gca agt cag aca gtc act tta tct gga 1651 
Gly Thr Leu Lys Ala Thr Gin Ala Ser Gin Thr Val Thr Leu Ser Gly 
505 510 515 

teg etc tct ctt gta gat cct tct gga aat gtc tac gaa gat gtc tct 1699 
Ser Leu Ser Leu Val Asp Pro Ser Gly Asn Val Tyr Glu Asp Val Ser 
520 525 530 
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tgg aat aac cct caa gtc ttt tct tgt etc act ctt act get gac gac 
Trp Asn Asn Pro Gin Val.Phe Ser Cys Leu Thr Leu Thr Ala Asp Asp 
535 540 545 



1747 



ccc gcg aat att cac ate aca gac tta get get gat ccc eta gaa aaa 
Pro Ala Asn lie His lie Thr Asp Leu Ala Ala Asd Pro Leu Glu Lys 
550 555 560 " 565 



1795 



aat cct ate cat tgg gga tac caa ggg aat tgg gca tta tct tgg caa 
Asn Pro lie His Trp Gly Tyr Gin Gly Asn Trp Ala Leu Ser Trp Gin 
570 575 580 



1843 



gag gat act gcg act aaa tec aaa gca gcg act ctt acc tgg aca aaa 
Glu Asp Thr Ala Thr Lys Ser Lys Ala Ala Thr Leu Thr Trp Thr Lys 
585 590 595 



1891 



aca gga tac aat ccg aat cct gag cgt cgt gga acc tta gtt get aac 
Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Gly Thr Leu Val Ala Asn 
600 605 610 



1939 



acg eta tgg gga tec ttt gtt gat gtg cgc tec ata caa cag ctt gta 
Thr Leu Trp Gly Ser Phe Val Asp Val Arg Ser lie Gin Gin Leu Val 
615 620 625 



1987 



gec act aaa gta cgc caa tct caa gaa act cgc ggc ate tgg tgt gaa 
Ala Thr Lys Val Arg Gin Ser Gin Glu Thr Arg Gly lie Trp Cys Glu 
630 635 640 645 



2035 



ggg ate teg aac ttc ttc cat aaa gat age acg aag ata aat aaa ggt 
Gly lie Ser Asn Phe Phe His Lys Asp Ser Thr Lys lie Asn Lys Gly 
650 655 660 



2083 



ttt cgc cac ata agt gca ggt tat gtt gta gga gcg act aca aca tta 
Phe Arg His lie Ser Ala Gly Tyr Val Val Gly Ala Thr Thr Thr Leu 
665 670 675 



2131 



get tct gat aat ctt ate act gca gec ttc tgc caa tta ttc ggg aaa 
Ala Ser Asp Asn Leu lie Thr Ala Ala Phe Cys Gin Leu Phe Gly Lys 
680 685 690 



2179 



gat aga gat cac ttt ata aat aaa aat aga get tct gee tat gca get 
Asp Arg Asp His Phe lie Asn Lys Asn Arg Ala Ser Ala Tyr Ala Ala 
695 700 705 



2227 



tct etc cat etc cag cat eta gcg acc ttg tct tct cca age ttg tta 
Ser Leu His Leu Gin His Leu Ala Thr Leu Ser Ser Pro Ser Leu Leu 
710 715 720 725 



2275 



cgc tac ctt cct gga tct gaa agt gag cag cct gtc etc ttt gat get 
Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro Val Leu Phe Asp Ala 
730 735 740 



2323 



cag ate age tat ate tat agt aaa aat act atg aaa acc tat tac acc 
Gin lie Ser Tyr lie Tyr Ser Lys Asn Thr Met Lys Thr Tyr Tyr Thr 
745 750 755 



2371 
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caa gca cca aag gga gag age teg tgg tat. aat gac ggt tge-get ctg 2*419 
Gin Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn Asp Gly Cys Ala Leu 
760 765 770 

gaa ctt gcg age tec eta cca cac act get tta age cat gag ggt etc 24 67 
Glu Leu Ala Ser Ser Leu Pro His Thr Ala Leu Ser His Glu Gly Leu 
775 780 785 

ttc cac gcg tat ttt cct ttc ate aaa gta gaa get teg tac ata cac 2515 
Phe His Ala Tyr Phe Pro Phe lie Lys Val Glu Ala Ser Tyr lie His 
790 795 800 805 

caa gat age ttc aaa gaa cgt aat act ace ttg gta cga tct ttc gat 2563 
Gin Asp Ser Phe Lys Glu Arg Asn Thr Thr Leu Val Arg Ser Phe Asp 
810 815 820 

age ggt gat tta att aac gtc tct gtg cct att gga att ace ttc gag 2611 
Ser Gly Asp Leu He Asn Val Ser Val Pro He Gly He Thr Phe Glu 
825 830 835 

aga ttc teg aga aac gag cgt gcg tct tac gaa get act gtc ate tac 2659 
Arg Phe Ser Arg Asn Glu Arg Ala Ser Tyr Glu Ala Thr Val He Tyr 
840 845 850 

gtt gee gat gtc tat cgt aag aat cct gac tgc acg aca get etc eta 2707 
Val Ala Asp Val Tyr Arg Lys Asn Pro Asp Cys Thr Thr Ala Leu Leu 
855 860 865 

ate aac aat acc teg tgg aaa act aca gga acg aat etc tea aga caa 2755 
He Asn Asn Thr Ser Trp Lys Thr Thr Gly Thr Asn Leu Ser Arg Gin 
870 875 880 885 

get ggt ate gga aga gca ggg ate ttt tat gec ttc tct cca aat ctt 2803 
Ala Gly He Gly Arg Ala Gly He Phe Tyr Ala Phe Ser Pro Asn Leu 
890 895 900 

gag gtc aca agt aac eta tct atg gaa att cgt gga tct tea cgc age 2851 
Glu Val Thr Ser Asn Leu Ser Met Glu He Arg Gly Ser Ser Arg Ser 
905 910 915 

tac aat gca gat ctt gga ggt aag ttc cag ttc taaaagcgtt cctgatccct 2904 
Tyr Asn Ala Asp Leu Gly Gly Lys Phe Gin Phe 
920 925 

tagaaattct aagagatcct gagtgtatct agggacttct caaaga 2950 



<210>6 

<211>2784 

<212>DNA 

<2 13>Chlamydiapneumoniae 

<220> 

<221>CDS 

<222>(1) . . (2784) 

<400>6 
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atgaaatcct ctcttcattg gtttttaatc tcgtcatctt tagcacttcc cttgtcacta 60 

aatttctctg cgtttgctgc tgttgttgaa atcaatctag gacctaccaa tagcttctct 120 

ggaccaggaa cctacactcc tccagcccaa acaacaaatg caoatggaac tatctataat 180 

ctaacagggg atgtctcaat caccaatgca ggatctccga cagctctaac cgcttcctgc 240 

tttaaagaaa ctactgggaa tctttctttc caaggccacg gctaccaatt tctcctacaa 300 

aatatcgatg cgggagcgaa ctgtaccttt. accaatacag ctgcaaataa gcttctctcc 360 

ttttcaggat tctcctattt gtcactaata caaaccacga atgctaccac aggaacagga 420 

gccatcaagt ccacaggagc ttgttctatt cagtcgaact atagttgcta ctttggccaa 480 

aacttttcta atgacaatgg aggcgccctc caaggcagct ctatcagtct atcgctaaac 540 

cccaacctaa cgtttgccaa aaacaaagca acgcaaaaag ggggtgccct ctattccacg 600 

ggagggatta caattaacaa tacgttaaac tcagcatcat tttctgaaaa taccgcggcg 660 

aacaatggcg gagccattta cacggaagct agcagtttta ttagcagcaa caaagcaatt 720 

agctttataa acaatagtgt gaccgcaacc tcagctacag ggggagccat ttactgtagt 780 

agtacatcag cccccaaacc agtcttaact ctatcagaca acggggaact gaactttata 840 

ggaaatacag caattactag tggtggggcg atttatactg acaatctagt tctttcttct 900 

ggaggaccta cgctttttaa aaacaactct ggctatgata ctgcagctcc cttaggagga 960 

gcaattgcga ttgctgactc tggatctttg agtctttcgg ctcttggtgg agacatcact 1020 

tttgaaggaa acacagtagt caaaggagct tcttcgagtc agaccactac cagaaattct 1080 

attaacatcg gaaacaccaa tgctaagatt gtacagctgc gagcctctca aggcaatact 1140 

atctacttct atgatcctat aacaactagc atcactgcag ctctctcaga tgctctaaac 1200 

ttaaatggtc ctgaccttgc agggaatcct gcatatcaag gaaccatcgt attttctgga 1260 

gagaagctct cggaagcaga agctgcagaa gctgataatc tcaaatctac aattcagcaa 1320 

cctctaactc ttgcgggagg gcaactctct cttaaatcag gagtcactct agttgctaag 1380 

tccttttcgc aatctccggg ctctaccctc ctcatggatg cagggaccac attagaaacc 1440 

gctgatggga tcactatcaa taatcttgtt ctcaatgtag attccttaaa agagaccaag 1500 

aagggcacgc taaaagcaac acaagcaagt cagacagtca ctttatctgg atcgctctct 1560 

cttgtagatc cttctggaaa tgtctacgaa gatgtctctt ggaataaccc tcaagtcttt 1620 

tcttgtctca ctcttactgc tgacgacccc gcgaatattc acatcacaga cttagctgct 1680 

gatcccctag aaaaaaatcc tatccattgg ggataccaag ggaattgggc attatcttgg 1740 

caagaggata ctgcgactaa atccaaagca gcgactctta cctggacaaa aacaggatac 1800 

aatccgaatc ctgagcgtcg tggaacctta gttgctaaca cgctatgggg atcctttgtt 1860 

gatgtgcgct ccatacaaca gcttgtagcc actaaagtac gccaatctca agaaactcgc 1920 

ggcatctggt gtgaagggat ctcgaacttc ttccataaag atagcacgaa gataaataaa 1980 

ggttttcgcc acataagtgc aggttatgtt gtaggagcga ctacaacatt agcttctgat 2040 

aatcttatca ctgcagcctt ctgccaatta ttcgggaaag atagagatca ctttataaat 2100 

aaaaatagag cttctgccta tgcagcttct ctccatctcc agcatctagc gaccttgtct 2160 

tctccaagct tgttacgcta ccttcctgga tctgaaagtg agcagcctgt cctctttgat 2220 

gctcagatca gctatatcta tagtaaaaat actatgaaaa cctattacac ccaagcacca 2280 

aagggagaga gctcgtggta taatgacggt tgcgctctgg aacttgcgag ctccctacca 234 0 

cacactgctt taagccatga gggtctcttc cacgcgtatt ttcctttcat caaagtagaa 2400 

gcttcgtaca tacaccaaga tagcttcaaa gaacgtaata ctaccttggt acgatctttc 24 60 

gatagcggtg atttaattaa cgtctctgtg cctattggaa ttaccttcga gagattctcg 2520 

agaaacgagc gtgcgtctta cgaagctact gtcatctacg ttgccgatgt ctatcgtaag 2580 

aatcctgact gcacgacagc tctcctaatc aacaatacct cgtggaaaac tacaggaacg 264 0 

aatctctcaa gacaagctgg tatcggaaga gcagggatct tttatgcctt ctctccaaat 2700 

cttgaggtca caagtaacct atctatggaa attcgtggat cttcacgcag ctacaatgca 2760 

gatcttggag gtaagttcca gttc 2784 
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<210> 7 
<211> 3000 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 

<221> CDS 

<222> (101) . . (2890) 
<400> 7 

gtacgaagtt cttcacgaaa ttataataca aacctaggct ctaagttttg tttctagatt 60 

atcgaaaacg tgttaattaa ttgaacccaa gcatctttct atg aaa ata ccc ttg 115 

Met Lys lie Pro Leu 
1 5 



cac aaa etc ctg ate tct teg act ctt. gtc act ccc att eta ttg age 
His Lys Leu Leu lie Ser Ser Thr Leu Val Thr Pro lie Leu Leu Ser 
10 15 20 



163 



att gca act tac gga gca gat get tct tta tec cct aca gat age ttt 
lie Ala Thr Tyr Gly Ala Asp Ala Ser Leu Ser Pro Thr Asp Ser Phe 
25 30 35 



211 



gat gga gcg ggc ggc tct aca ttt act cca aaa tct aca gca gat gee 
Asp Gly Ala Gly Gly Ser Thr Phe Thr Pro Lys Ser Thr Ala Asp Ala 
40 45 50 



259 



aat gga acg aac tat gtc tta tea gga aat gtc tat ata aac gat get 
Asn Gly Thr Asn Tyr Val Leu Ser Gly Asn Val Tyr lie Asn Asp Ala 
55 60 65 



307 



ggg aaa ggc aca gca tta aca ggc tgc tgc ttt aca gaa act acg ggt 
Gly Lys Gly Thr Ala Leu Thr Gly Cys Cys Phe Thr Glu Thr Thr Gly 
70 75 80 85 



355 



gat ctg aca ttt act gga aag gga tac tea ttt tea ttc aac acg gta 
Asp Leu Thr Phe Thr Gly Lys Gly Tyr Ser Phe Ser Phe Asn Thr Val 
90 95 100 



403 



gat gcg ggt teg aat gca gga get gcg gca age aca act get gat aaa 4 51 

Asp Ala Gly Ser Asn Ala Gly Ala Ala Ala Ser Thr Thr Ala Asp Lys 

105 110 115 

gec eta ate ttc aca gga ttt tct aac ctt tec ttc att gca get cct 4 99 

Ala Leu lie Phe Thr Gly Phe Ser Asn Leu Ser Phe He Ala Ala Pro 

120 125 130 



gga act aca gtt get tea gga aaa agt act tta agt tct gca gga gee 547 
Gly Thr Thr Val Ala Ser Gly Lys Ser Thr Leu Ser Ser Ala Gly Ala 
135 140 145 

tta aat ctt acc gat aat gga acg att etc ttt age caa aac gtc tec 595 
Leu Asn Leu Thr Asp Asn Gly Thr He Leu Phe Ser Gin Asn Val Ser 
150 155 160 165 
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aat gaa get aat aac aat ggc gga gcg ate acc aca aaa act ctt tct 64 3 
Asn Glu Ala Asn Asn Asn Gly Gly Ala lie Thr Thr Lys Thr Leu Ser 
170 175 180 

att tct ggg aat acc tct tct ata acc ttc act age aat age gca aaa 691 
lie Ser Gly Asn Thr Ser Ser lie Thr Phe Thr Ser Asn Ser Ala Lys 
185 190 195 

aaa tta ggt gga gcg ate tat age tct gcg get gca agt att tea gga 739 
Lys Leu Gly Gly Ala lie Tyr Ser Ser Ala Ala Ala Ser lie Ser Gly 
200 205 210 

aac acc ggc cag tta gtc ttt atg aat aat aaa gga gaa act ggg ggt 787 
Asn Thr Gly Gin Leu Val Phe Met Asn Asn Lys Gly Glu Thr Gly Gly 
215 220 225 

ggg get ctg ggc ttt gaa gec age tec teg att act caa aat age tec 835 
Gly Ala Leu Gly Phe Glu Ala Ser Ser Ser lie Thr Gin Asn Ser Ser 
230 235 240 245 

ctt ttc ttc tct gga aac act gca aca gat get gca ggc aag ggc ggg 883 
Leu Phe Phe Ser Gly Asn Thr Ala Thr Asp Ala Ala Gly Lys Gly Gly 
250 255 260 

gec att tat tgt gaa aaa aca gga gag act cct act ctt act ate tct 931 
Ala lie Tyr Cys Glu Lys Thr Gly Glu Thr Pro Thr Leu Thr lie Ser 
265 270 275 

gga aat aaa agt ctg acc ttc gee gag aac tct tea gta act caa ggc 97 9 
Gly Asn Lys Ser Leu Thr Phe Ala Glu Asn Ser Ser Val Thr Gin Gly 
280 285 290 

gga gca ate tgt gec cat ggt eta gat ctt tec get get ggc cct acc 1027 
Gly Ala lie Cys Ala His Gly Leu Asp Leu Ser Ala Ala Gly Pro Thr 
295 300 305 

eta ttt tea aat aat aga tgc ggg aac aca get gca ggc aag ggc ggc 1075 
Leu Phe Ser Asn Asn Arg Cys Gly Asn Thr Ala Ala Gly Lys Gly Gly 
310 315 320 325 

get att gca att gec gac tct gga tct tta agt etc tct gca aat caa 1123 
Ala lie Ala lie Ala Asp Ser Gly Ser Leu Ser Leu Ser Ala Asn Gin 
330 335 340 

gga gac ate acg ttc ctt ggc aac act eta acc tea acc tec gcg cca 1171 
Gly Asp lie Thr Phe Leu Gly Asn Thr Leu Thr Ser Thr Ser Ala Pro 
345 350 355 

aca teg aca egg aat get ate tac ctg gga teg tea gca aaa att acg 1219 
Thr Ser Thr Arg Asn Ala lie Tyr Leu Gly Ser Ser Ala Lys lie Thr 
360 365 370 

aac tta agg gca gec caa ggc caa tct ate tat ttc tat gat ccg att 12 67 
Asn Leu Arg Ala Ala Gin Gly Gin Ser lie Tyr Phe Tyr Asp Pro lie 
375 380 385 
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gca tct aac acc 
Ala Ser Asn Thr 
390 
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aca gga get tea gac gtt ctg acc ate aac caa 
Thr Gly Ala Ser Asp Val Leu Thr lie Asn Gin 
395 400 



8 5 ft 28 
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ccg 
Pro 
405 



1315 



gat age aac teg 
Asp Ser Asn Ser 



cct tta gat tat tea gga acg att gta ttt tct ggg 
Pro Leu Asp Tyr Ser Gly Thr lie Val Phe Ser Gly 
410 415 420 



1363 



gaa aag etc tct 
Glu Lys Leu Ser 
425 



gca gat gaa gcg aaa get get gat aac ttc aca tct 
Ala Asp Glu Ala Lys Ala Ala Asp Asn Phe Thr Ser 
430 435 



1411 



ata tta aag caa 
lie Leu Lys Gin 
440 



cca ttg get eta gec tct gga acc tta gca etc aaa 
Pro Leu Ala Leu Ala Ser Gly Thr Leu Ala Leu Lys 
445 450 



1459 



gga aat gtc gag 
Gly Asn Val Glu 
455 



tta gat gtc aat ggt ttc aca cag act gaa ggc tct 
Leu Asp Val Asn Gly Phe Thr Gin Thr Glu Gly Ser 
460 465 



1507 



aca etc etc atg 
Thr Leu Leu Met 
470 



caa cca gga aca aag etc aaa gca gat act gaa get 
Gin Pro Gly Thr Lys Leu Lys Ala Asp Thr Glu Ala 
475 480 485 



1555 



ate agt ctt acc 
lie Ser Leu Thr 



aaa ctt gtc gtt gat ctt tct gec tta gag gga aat 
Lys Leu Val Val Asp Leu Ser Ala Leu Glu Gly Asn 
490 495 500 



1603 



aag agt gtg tec 
Lys Ser Val Ser 
505 



att gaa aca gca gga gec aac aaa act ata act eta 
lie Glu Thr Ala Gly Ala Asn Lys Thr lie Thr Leu 
510 515 



1651 



acc tct cct ctt 
Thr Ser Pro Leu 
520 



gtt ttc caa gat agt age ggc aat ttt tat gaa age 
Val Phe Gin Asp Ser Ser Gly Asn Phe Tyr Glu Ser 
525 530 



1699 



cat acg ata aac 
His Thr He Asn 
535 



caa gec ttc acg cag cct ttg gtg gta ttc act get 
Gin Ala Phe Thr Gin Pro Leu Val Val Phe Thr Ala 
540 545 



1747 



get act get get 
Ala Thr Ala Ala 
550 

gta caa act cca 
Val Gin Thr Pro 



age gat att tat ate gat gcg ctt etc act tct cca 17 95 

Ser Asp He Tyr He Asp Ala Leu Leu Thr Ser Pro 
555 560 565 

gaa cct cat tac ggg tat cag gga cat tgg gaa gee 1843 

Glu Pro His Tyr Gly Tyr Gin Gly His Trp Glu Ala 

570 575 580 



act tgg gca gac 
Thr Trp Ala Asp 
585 



aca tea act gca aaa tea gga act atg act tgg gta 
Thr Ser Thr Ala Lys Ser Gly Thr Met Thr Trp Val 
590 595 



1891 



act acg ggc tac 
Thr Thr Gly Tyr 
600 



aac cct aat cct gag cgt aga get tec gta gtt ccc 
Asn Pro Asn Pro Glu Arg Arg Ala Ser Val Val Pro 
605 610 



1939 
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gat tea tta tgg gca tec ttt act gac att cgc act eta cag cag ate 1987 
Asp Ser Leu Trp Ala Ser Phe Thr Asp lie Arg Thr Leu Gin Gin He 
615 620 625 

atg aca tct caa gcg aat agt ate tat cag caa cga gga etc tgg gca 2035 
Met Thr Ser Gin Ala Asn Ser He Tyr Gin Gin Arg Gly Leu Trp Ala 
630 635 640 645 

tea gga act gcg aat ttc ttc cat aag gat aaa tea gga act aac caa 2083 
Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys Ser Gly Thr Asn Gin 
650 655 660 

gca ttc cga cat aaa age tac ggc tat att gtt gga gga agt get gaa 2131 
Ala Phe Arg His Lys Ser Tyr Gly Tyr He Val Gly Gly Ser Ala Glu 
665 670 675 

gat ttt tct gaa aat ate ttc agt gta get ttc tgc cag etc ttc ggt 2179 
Asp Phe Ser Glu Asn lie Phe Ser Val Ala Phe Cys Gin Leu Phe Gly 
680 685 690 

aaa gat aaa gac ctg ttt ata gtt gaa aat ace tct cat aac tat tta 2227 
Lys Asp Lys Asp Leu Phe He Val Glu Asn Thr Ser His Asn Tyr Leu 
695 700 705 

gcg teg eta tac ctg caa cat cga gca ttc eta gga gga ctt ccc atg 2275 
Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu Gly Gly Leu Pro Met 
710 715 720 725 

ccc tea ttt gga agt ate acc gac atg ctg aaa gat att cct etc att 2323 
Pro Ser Phe Gly Ser He Thr Asp Met Leu, Lys Asp He Pro Leu He 
730 735 740 

ttg aat gec cag eta age tac age tac act aaa aat gat atg gat act 2371 
Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys Asn Asp Met Asp Thr 
745 750 755 

cgc tat act tec tat cct gaa get caa ggc tct tgg acc aat aac tct 2419 
Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser Trp Thr Asn Asn Ser 
760 765 770* 

ggg get eta gag etc gga gga tct ctg get eta tat etc cct aaa gaa 24 67 
Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu Tyr Leu Pro Lys Glu 
775 780 785 

gca ccg ttc ttc cag gga tat ttc ccc ttc tta aag ttc cag gca gtc 2515 
Ala Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu Lys Phe Gin Ala Val 
790 795 800 805 

tac age cgc caa caa aac ttt aaa gag agt ggc get gaa gec cgt get 2563 
Tyr Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly Ala Glu Ala Arg Ala 
810 815 820 

ttt gat gat gga gac eta gtg aac tgc tct ate cct gtc ggc att egg 2611 
Phe Asp Asp Gly Asp Leu Val Asn Cys Ser He Pro Val Gly He Arg 
825 830 835 
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tta gaa aaa ate tec gaa gat gaa aaa aat aat ttc gag att tct eta 
Leu Glu Lys lie Ser Glu Asp Glu Lys Asn Asn Phe Glu lie Ser Leu 
840 845 850 



2659 



gee tac att ggt gat gtg tat cgt aaa aat ccc cgt teg cgt act tct 2707 
Ala Tyr lie Gly Asp Val Tyr Arg Lys Asn Pro Arg Ser Arg Thr Ser 
855 860 865 

eta atg gtc agt gga gec tct tgg act teg eta tgt aaa aac etc gca 2755 
Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu Cys Lys Asn Leu Ala 
870 875 880 885 

cga caa gec ttc tta gca agt get gga age cat ctg act etc tec cct 2803 
Arg Gin Ala Phe Leu Ala Ser Ala Gly Ser His Leu Thr Leu Ser Pro 
890 895 900 

cat gta gaa etc tct ggg gaa get get tat gag ctt cgt ggc tea gca 2851 
His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu Leu Arg Gly Ser Ala 
905 910 915 

cac ate tac aat gta gat tgt ggg eta aga tac tea ttc tagttcctac 2900 
His lie Tyr Asn Val Asp Cys Gly Leu Arg Tyr Ser Phe 
920 925 930 

tttcctccct aaacttttag ggaggaattc ttataaaaac cctgtagatt cttaacttac 2960 

tagtctctcc tttcctcttg ctttctttaa tttattgcag 3000 



<210>8 

<211>2790 

<212>DNA 

<213>Chlamydiapneumoniae 

<220> 

<221>CDS 

<222>(1) . . (2790) 

<400>8 



atgaaaatac 
ageattgeaa 
ggeggctcta 
tcaggaaatg 
acagaaaeta 
gtagatgegg 
ttcacaggat 
aaaagtactt 
agecaaaacg 
tctatttctg 
ggagegatet 
atgaataata 
actcaaaata 
ggggccattt 
agtctgacct 
ctagatcttt 
gcaggcaagg 



ccttgcacaa 
ettaeggage 
catttactcc 
tctatataaa 
egggtgatet 
gttcgaatgc 
tttctaacct 
taagttctgc 
tctccaatga 
ggaatacctc 
atagctctgc 
aaggagaaac 
gctccctttt 
attgtgaaaa 
tegecgagaa 
ccgctgctgg 
geggegctat 



actcctgatc 
agatgettet 
aaaatctaca 
cgatgctggg 
gacatttact 
aggagctgeg 
ttccttcatt 
aggagectta 
agctaataac 
ttctataacc 
ggctgcaagt 
tgggggtggg 
cttctctgga 
aacaggagag 
ctcttcagta 
ccctacccta 
tgeaattgee 



tcttcgactc 
ttatccccta 
geagatgeca 
aaaggcacag 
ggaaagggat 
gcaagcacaa 
gcagctcctg 
aatcttaccg 
aatggcggag 
ttcactagta 
atttcaggaa 
gctctgggct 
aacactgeaa 
actcctactc 
actcaaggcg 
ttttcaaata 
gactctggat 



ttgtcactcc 
cagatagctt 
atggaacgaa 
cattaacagg 
actcattttc 
ctgctgataa 
gaactacagt 
ataatggaac 
cgatcaccac 
atagegcaaa 
acaccggcca 
ttgaagccag 
cagatgetge 
ttactatctc 
gagcaatctg 
atagatgegg 
ctttaagtct 



cattctattg 
tgatggagcg 
ctatgtctta 
ctgctgcttt 
attcaacacg 
agccctaatc 
tgcttcagga 
gattctcttt 
aaaaactctt 
aaaattaggt 
gttagtcttt 
ctcctcgatt 
aggcaagggc 
tggaaataaa 
tgcccatggt 
gaacacagct 
etctgeaaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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caaggagaca 
cggaatgcta 
caatctatct 
accatcaacc 
ggggaaaagc 
caaccattgg 
aatggtttca 
gcagatactg 
aataagagtg 
cttgttttcc 
acgcagcctt 
ttctcacttc 
ccacttgggc 
acaaccctaa 
ctgacattcg 
gaggactctg 
aagcattccg 
aaaatatctt 
ttgaaaatac 
gaggacttcc 
ttttgaatgc 
cctatcctga 
ctctggctct 
agttccaggc 
cttttgatga 
tctccgaaga 
gtaaaaatcc 
tagtaaaaac 
cctcatgtag 
aatgtagatt 



tcacgttcct 
tctacctggg 
atttctatga 
aaccggatag 
tctctgcaga 
ctctagcctc 
cacagactga 
aagctatcag 
tgtccattga 
aagatagtag 
tggtggtatt 
tccagtacaa 
agacacatca 
tcctgagcgt 
cactctacag 
ggcatcagga 
acataaaagc 
cagtgtagct 
ctctcataac 
catgccctca 
ccagctaagc 
agctcaaggc 
atatctccct 
agtctacagc 
tggagaccta 
tgaaaaaaat 
ccgttcgcgt 
ctcgcacgac 
aactctctgg 
gtgggctaag 



tggcaacact 
atcgtcagca 
tccgattgca 
caactcgcct 
tgaagcgaaa 
tggaacctta 
aggctctaca 
tcttaccaaa 
aacagcagga 
cggcaatttt 
actgctgcta 
actccagaac 
actgcaaaat 
agagcttccg 
cagatcatga 
actgcgaatt 
tacggctata 
ttctgccagc 
tatttagcgt 
tttggaagta 
tacagctaca 
tcttggacca 
aaagaagcac 
cgccaacaaa 
gtgaactgct 
aatttcgaga 
acttctctaa 
aagccttctt 
ggaagctgct 
atactcattc 



ctaacctcaa 
aaaattacga 
tctaacacca 
ttagattatt 
gctgctgata 
gcactcaaag 
ctcctcatgc 
cttgtcgttg 
gccaacaaaa 
tatgaaagcc 
ctgctgctag 
ctcattacgg 
caggaactat 
tagttcccga 
catctcaagc 
tcttccataa 
ttgttggagg 
tcttcggtaa 
cgctatacct 
tcaccgacat 
ctaaaaatga 
ataactctgg 
cgttcttcca 
actttaaaga 
ctatccctgt 
tttctctagc 
tggtctcagt 
agcaagtgct 
tatgagcttc 



cctccgcgcc 
acrtaagggc 
caggagcttc 
caggaacgat 
acttcacatc 
gaaatgtcga 
aaccaggaac 
atctttctgc 
ctataactct 
atacgataaa 
cgatatttat 
gtarcaggga 
gacttgggta 
ttcattatgg 
gaatagtatc 
ggataaatca 
aagtgctgaa 
agataaagac 
gcaacatcga 
gctgaaagat 
tatggatact 
ggctctagag 
gggatatttc 
gagtggcgct 
cggcattcgg 
ctacattggt 
ggagcctctt 
ggaagccatc 
gtggctcagc 



aacatcgaca 
agcccaaggc 
agacgttctg 
tgtattttct 
tatattaaag 
gttagatgtc 
aaagctcaaa 
cttagaggga 
aacctctcct 
ccaagccttc 
atcgatgcgc 
cattgggaag 
actacgggct 
gcatccttta 
tatcagcaac 
ggaactaacc 
gatttttctg 
ctgtttatag 
gcattcctag 
attcctctca 
cgctatactt 
ctcggaggat 
cccttcttaa 
gaagcccgtg 
ttagaaaaaa 
gatgtgtatc 
ggacttcgct 
tgactctctc 
acacatctac 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2790 



<210> 9 

<211> 1100 

<212> DNA 

<213> Chlamydia pneumoniae 

<220> 

<221> CDS 

<222> (101) . . (979) 

<400> 9 

ataaagtttt ttatatgaac aaaactttga cttcattggc agacacttcg accttaacag 60 

accaattttg ttgtcatccc tataaaaatc aggaattttc atg etc tec tea eta 115 

Met Leu Ser Ser Leu 
1 5 

ate cgt gat tea ttt ccc ctt ctt att tta ctt ccc aca ttc eta gcg 163 
lie Arg Asp Ser Phe Pro Leu Leu lie Leu Leu Pro Thr Phe Leu Ala 
10 15 20 

gca tta gga gee tec gta get ggc ggc gtt atg gga ace tat ate gtt 211 
Ala Leu Gly Ala Ser Val Ala Gly Gly Val Met Gly Thr Tyr lie Val 
25 30 35 
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gta aaa cgt att gtt tea att agt gga agt ata tct cat gca att eta 259 
Val Lys Arg lie Val Ser lie Ser Gly Ser lie Ser His Ala lie Leu 
40 45 50 

gga gga att ggc etc ace eta tgg ata caa tat aag ctt cat etc tct 307 
Gly Gly lie Gly Leu Thr Leu Trp lie Gin Tyr Lys Leu His Leu Ser 
55 60 65 

ttt ttc cct atg tat gga get att gta gga get att ttt eta get ctt 355 
Phe Phe Pro Met Tyr Gly Ala lie Val Gly Ala lie Phe Leu Ala Leu 
70 75 80 85 

tgc ate ggc aag ate cac ctg aaa tac caa gaa agg gaa gac tct ttg 403 
Cys lie Gly Lys He His Leu Lys Tyr Gin Glu Arg Glu Asp Ser Leu 
90 95 100 

att gcg atg att tgg tct gtg ggc atg gca att gga att ata ttc att 451 
He Ala Met He Trp Ser Val Gly Met Ala He Gly He He Phe He 
105 110 115 

tec agg ctt ccc acc ttt aat gga gag etc ate aat ttt eta ttt ggg 499 
Ser Arg Leu Pro Thr Phe Asn Gly Glu Leu He Asn Phe Leu Phe Gly 
120 125 130 

aac att etc tgg gtc acc cct tea gac etc tat age tta gga ate ttt 547 
Asn He Leu Trp Val Thr Pro Ser Asp Leu Tyr Ser Leu Gly He Phe 
135 140 145 

gat ctt ctt gtt tta gga att gtg gtc ctt tgc cac acc egg ttc ctt 595 
Asp Leu Leu Val Leu Gly He Val Val Leu Cys His Thr Arg Phe Leu 
150 155 160 165 

get ctt tgc ttt gat gag agg tac acg get tta aac cat tgt tct gta 643 
Ala Leu Cys Phe Asp Glu Arg Tyr Thr Ala Leu Asn His Cys Ser Val 
170 175 180 

cag ctg tgg tat ttc eta ctt ctt gtt ctg aca gca ate acg att gtg 691 
Gin Leu Trp Tyr Phe Leu Leu Leu Val Leu Thr Ala He Thr He Val 
185 190 195 

atg ttg att tat gtg atg gga acg att ctg atg ctt age atg etc gtc 739 
Met Leu He Tyr Val Met Gly Thr lie Leu Met Leu Ser Met Leu Val 
200 205 210 

tta cct gtt get ata gcg tgt aga ttt teg tac aag atg aca cga att 787 
Leu Pro Val Ala He Ala Cys Arg Phe Ser Tyr Lys Met Thr Arg He 
215 220 225 

atg ttc ate teg gtc etc ttg aat ate tta tgt tct ttt tct gga att 835 
Met Phe He Ser Val Leu Leu Asn He Leu Cys Ser Phe Ser Gly He 
230 235 240 245 

tgc ate gee tac tgt tta gat ttc cca gta ggt cct acg ata tea ttg 883 
Cys He Ala Tyr Cys Leu Asp Phe Pro Val Gly Pro Thr He Ser Leu 
250 255 260 
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ctg atg ggg tta ggt tat aca gcg agt ctt tgt gtg aag aag egg tac 931 
Leu Met Gly Leu Gly Tyr Thr Ala Ser Leu Cys Val Lys Lys Arg Tyr 
265 270 275 

aat ccg teg acg cct tct cct gta agt cct gaa ate aat aca aat gta 979 
Asn Pro Ser Thr Pro Ser Pro Val Ser Pro Glu lie Asn Thr Asn Val 
280 285 290 

tagctaggga agegcttttg gaagctttgg aggcattctt cctgttcgtc aggaagaaga 1039 

tcatcaattt tatttaaagc taccagcata tctttctttt caaaatctgg ctgatgagag 1099 

t 1100 



<210> 10 
<211> 880 
<212> DNA 

<213> Chlamydiapneumoniae 

<220> 

<221> CDS 

<222> (1) . . (880) 

<400> 10 



atgctctcct 
geggcattag 
attgtttcaa 
tggatacaat 
atttttctag 
ttgattgega 
cccaccttta 
tcagacctct 
cacacccggt 
gtacagctgt 
tatgtgatgg 
agattttcgt 
tctttttctg 
ttgctgatgg 
gacgccttct 



cactaatccg 
gagcctccgt 
ttagtggaag 
ataagcttca 
etctttgeat 
tgatttggtc 
atggagagct 
atagcttagg 
tccttgctct 
ggtatttcct 
gaacgattct 
acaagatgae 
gaatttgeat 
ggttaggtta 
cctgtaagtc 



tgattcattt 
agetggegge 
tatatctcat 
tctctctttt 
eggcaagate 
tgtgggcatg 
catcaatttt 
aatctttgat 
ttgetttgat 
acttcttgtt 
gatgettage 
acgaattatg 
cgcctactgt 
tacagegagt 
ctgaaatcaa 



ccccttctta 
gttatgggaa 
gcaattctag 
ttccctatgt 
cacctgaaat 
gcaattggaa 
ctatttggga 
cttcttgttt 
gagaggtaca 
ctgacagcaa 
atgetegtet 
ttcatctegg 
ttagatttcc 
ctttgtgtag 
tacaaatgta 



ttttacttcc 
ectatategt 
gaggaattgg 
atggagctat 
aceaagaaag 
ttatattcat 
acattctctg 
taggaattgt 
eggctttaaa 
tcacgattgt 
tacctgttgc 
tcctcttgaa 
cagtaggt cc 
aagaagcggt 



cacattccta 
tgtaaaacgt 
cctcacccta 
tgtaggagct 
ggaagactct 
ttccaggctt 
ggtcacccct 
ggtcctttgc 
ccattgttct 
gatgttgatt 
tatagcgtgt 
tatcttatgt 
tacgatatca 
acaatccgtc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
880 



<210> 11 
<211> 928 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 11 

Met Lys Thr Ser lie Pro Trp Val Leu Val Ser Ser Val Leu Ala Phe 
1 5 10 15 

Ser Cys His Leu Gin Ser Leu Ala Asn Glu Glu Leu Leu Ser Pro Asp 
20 25 30 
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Asp Ser Phe Asn Gly Asn lie Asp Ser Gly Thr Phe Thr Pro Lys Thr 
35 40 45 

Ser Ala Thr Thr Tyr Ser Leu Thr Gly Asp Val Phe Phe Tyr Glu Pro 
50 55 60 

Gly Lys Gly Thr Pro Leu Ser Asp Ser Cys Phe Lys Gin Thr Thr Asp 
65 70 75 80 

Asn Leu Thr Phe Leu Gly Asn Gly His Ser Leu Thr Phe Gly Phe lie 
85 90 95 

Asp Ala Gly Thr His Ala Gly Ala Ala Ala Ser Thr Thr Ala Asn Lys 
100 105 110 

Asn Leu Thr Phe Ser Gly Phe Ser Leu .Leu Ser Phe Asp Ser Ser Pro 
115 120 125 

Ser Thr Thr Val Thr Thr Gly Gin Gly Thr Leu Ser Ser Ala Gly Gly 
130 135 140 

Val Asn Leu Glu Asn lie Arg Lys Leu Val Val Ala Gly Asn Phe Ser 
145 150 155 160 

Thr Ala Asp Gly Gly Ala lie Lys Gly Ala Ser Phe Leu Leu Thr Gly 
165 170 175 

Thr Ser Gly Asp Ala Leu Phe Ser Asn Asn Ser Ser Ser Thr Lys Gly 
180 185 190 

Gly Ala He Ala Thr Thr Ala Gly Ala Arg He Ala Asn Asn Thr Gly 
195 200 205 

Tyr Val Arg Phe Leu Ser Asn He Ala Ser Thr Ser Gly Gly Ala He 
210 215 220 

Asp Asp Glu Gly Thr Ser He Leu Ser Asn Asn Lys Phe Leu Tyr Phe 
225 230 235 240 

Glu Gly Asn Ala Ala Lys Thr Thr Gly Gly Ala He Cys Asn Thr Lys 
245 250 255 

Ala Ser Gly Ser Pro Glu Leu He He Ser Asn Asn Lys Thr Leu He 
260 265 270 

Phe Ala Ser Asn Val Ala Glu Thr Ser Gly Gly Ala He His Ala Lys 
275 280 285 

Lys Leu Ala Leu Ser Ser Gly Gly Phe Thr Glu Phe Leu Arg Asn Asn 
290 295 300 

Val Ser Ser Ala Thr Pro Lys Gly Gly Ala He Ser He Asp Ala Ser 
305 310 315 320 

Gly Glu Leu Ser Leu Ser Ala Glu Thr Gly Asn He Thr Phe Val Arg 



325 



330 



335 
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CORRESPONDING DNA 



WO 00/32794 



FRAGMENTS AND USES THEREoH 
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DOCKET NO.: 032931-0252 



J 



Asn Thr Leu Thr Thr Thr Gly Ser Thr Asp Thr Pro Lys Arg Asn Ala 

340 345 350 

lie Asn lie Gly Ser Asn Gly Lys Phe Thr Glu Leu Arg Ala Ala Lys 

355 360 365 

Asn His Thr lie Phe Phe Tyr Asp Pro lie Thr Ser Glu Gly Thr Ser 



Tyr Gin Gly Thr lie Leu Phe Ser Gly Glu Thr Leu Thr Ala Asp Glu 
405 410 415 

Leu Lys Val Ala Asp Asn Leu Lys Ser. Ser Phe Thr Gin Pro Val Ser 
420 425 430 

Leu Ser Gly Gly Lys Leu Leu Leu Gin Lys Gly Val Thr Leu Glu Ser 
435 440 445 

Thr Ser Phe Ser Gin Glu Ala Gly Ser Leu Leu Gly Met Asp Ser Gly 
450 455 460 

Thr Thr Leu Ser Thr Thr Ala Gly Ser lie Thr He Thr Asn Leu Gly 
465 470 475 480 

He Asn Val Asp Ser Leu Gly Leu Lys Gin Pro Val Ser Leu Thr Ala 
485 490 495 

Lys Gly Ala Ser Asn Lys Val He Val Ser Gly Lys Leu Asn Leu He 
500 505 510 

Asp He Glu Gly Asn He Tyr Glu Ser His Met Phe Ser His Asp Gin 
515 520 525 

Leu Phe Ser Leu Leu Lys He Thr Val Asp Ala Asp Val Asp Thr Asn 
530 535 540 

Val Asp He Ser Ser Leu He Pro Val Pro Ala Glu Asp Pro Asn Ser 
545 550 555 560 

Glu Tyr Gly Phe Gin Gly Gin Trp Asn Val Asn Trp Thr Thr Asp Thr 
565 570 * 575 

Ala Thr Asn Thr Lys Glu Ala Thr Ala Thr Trp Thr Lys Thr Gly Phe 
580 585 590 

Val Pro Ser Pro Glu Arg Lys Ser Ala Leu Val Cys Asn Thr Leu Trp 
595 600 605 

Gly Val Phe Thr Asp He Arg Ser Leu Gin Gin Leu Val Glu He Gly 
610 615 620 

Ala Thr Gly Met Glu His Lys Gin Gly Phe Trp Val Ser Ser Met Thr 
625 630 635 640 



370 



375 



380 



Ser Asp Val Leu Lys He Asn Asn Gly Ser Ala Gly Ala Leu Asn Pro 
385 390 395 400 
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Asn Phe Leu His Lys Thr Gly Asp Glu Asn Arg Lys Gly Phe Arg His 
645 650 655 

Thr Ser Gly Gly Tyr Val lie Gly Gly Ser Ala His Thr Pro Lys Asp 
660 665 670 

Asp Leu Phe Thr Phe Ala Phe Cys His Leu Phe Ala Arg Asp Lys Asp 
675 680 685 

Cys Phe lie Ala His Asn Asn Ser Arg Thr Tyr Gly Gly Thr Leu Phe 
690 695 700 

Phe Lys His Ser His Thr Leu Gin Pro Gin Asn Tyr Leu Arg Leu Gly 
705 710 715 720 

Arg Ala Lys Phe Ser Glu Ser Ala lie Glu Lys Phe Pro Arg Glu lie 
725 730 735 

Pro Leu Ala Leu Asp Val Gin Val Ser Phe Ser His Ser Asp Asn Arg 
740 745 750 

Met Glu Thr His Tyr Thr Ser Leu Pro Glu Ser Glu Gly Ser Trp Ser 
755 760 765 

Asn Glu Cys lie Ala Gly Gly lie Gly Leu Asp Leu Pro Phe Val Leu 
770 775 780 

Ser Asn Pro His Pro Leu Phe Lys Thr Phe lie Pro Gin Met Lys Val 
785 790 795 800 

Glu Met Val Tyr Val Ser Gin Asn Ser Phe Phe Glu Ser Ser Ser Asp 
805 810 815 

Gly Arg Gly Phe Ser lie Gly Arg Leu Leu Asn Leu Ser lie Pro Val 
820 825 830 

Gly Ala Lys Phe Val Gin Gly Asp lie Gly Asp Ser Tyr Thr Tyr Asp 
835 840 845 

Leu Ser Gly Phe Phe Val Ser Asp Val Tyr Arg Asn Asn Pro Gin Ser 
850 855 860 

Thr Ala Thr Leu Val Met Ser Pro Asp Ser Trp Lys He Arg Gly Gly 
865 870 875 880 

Asn Leu Ser Arg Gin Ala Phe Leu Leu Arg Gly Ser Asn Asn Tyr Val 
885 890 895 

Tyr Asn Ser Asn Cys Glu Leu Phe Gly His Tyr Ala Met Glu Leu Arg 
900 905 910 

Gly Ser Ser Arg Asn Tyr Asn Val Asp Val Gly Thr Lys Leu Arg Phe 
915 920 925 
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<210> 13 
<211> 885 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 13 

Gly Thr Tyr Thr Pro Lys Asn Thr Thr Thr Gly lie Asp Tyr Thr Leu 
15 10 15 

Thr Gly Asp lie Thr Leu Gin Asn Leu Gly Asp Ser Ala Ala Leu Thr 
20 25 30 

Lys Gly Cys Phe Ser Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys 
35 40 45 

Gly Tyr Ser Leu Ser Phe Leu Asn lie. Lys Ser Ser Ala Glu Gly Ala 
50 55 60 

Ala Leu Ser Val Thr Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser 
65 70 75 80 

Ser Leu Thr Phe Leu Ala Ala Pro Ser Ser Val lie Thr Thr Pro Ser 
85 90 95 

Gly Lys Gly Ala Val Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn 
100 105 110 

Gly Thr lie Leu Phe Lys Gin Asp Tyr Cys Glu Glu Asn Gly Gly Ala 
115 120 125 

lie Ser Thr Lys Asn Leu Ser Leu Lys Asn Ser Thr Gly Ser lie Ser 
130 135 140 

Phe Glu Gly Asn. Lys Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala lie 
145 150 155 160 

Cys Ala Thr Gly Thr Val Asp lie Thr Asn Asn Thr Ala Pro Thr Leu 
165 170 175 

Phe Ser Asn Asn lie Ala Glu Ala Ala Gly Gly Ala lie Asn Ser Thr 
180 185 190 

Gly Asn Cys Thr lie Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn 
195 200 205 

Ser Val Thr Ala Thr Ala Gly Asn Gly Gly Ala Leu Ser Gly Asp Ala 
210 215 220 

Asp Val Thr lie Ser Gly Asn Gin Ser Val Thr Phe Ser Gly Asn Gin 
225 230 235 240 

Ala Val Ala Asn Gly Gly Ala lie Tyr Ala Lys Lys Leu Thr Leu Ala 
245 250 255 

Ser Gly Gly Gly Gly Gly Asn Pro Phe Ser Asn Asn He Val Gin Gly 
260 265 270 
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Thr Thr Ala Gly Asn Gly Gly Ala lie Ser lie Leu Ala Ala Gly Glu 
275 280 .„ 285 

Cys Ser Leu Phe Ser Glu Ala Gly Asp His Tyr Leu Asn Gly Asn Ala 
290 295 300 

lie Val Ala Thr Thr Pro Gin Thr Thr Lys Arg Asn Ser lie Asp lie 
305 310 315 320 

Gly Ser Thr Gly Lys Asp His Glu Leu Arg Ala lie Ser Gly His Ser 
325 330 335 

lie Phe Phe Tyr Asp Pro lie Thr Ala Asn Thr Ala Ala Asp Ser Thr 
340 345 350 

Asp Thr Leu Asn Leu Asn Lys Ala Asp .Ala Gly Asn Ser Thr Asp Tyr 
355 360 365 

Ser Gly Ser lie Val Phe Ser Gly Glu Lys Leu Ser Glu Asp Glu Ala 
370 375 380 

Lys Val Ala Asp Asn Leu Thr Ser Thr Leu Lys Gin Pro Val Thr Leu 
385 390 395 400 

Thr Ala Gly Asn Leu Val Leu Lys Arg Gly Val Thr Leu Asp Thr Lys 
405 410 415 

Gly Phe Thr Gin Thr Ala Gly Ser Ser Val He Met Asp Ala Gly Thr 
420 425 430 

Thr Leu Lys Ala Ser Thr Glu Glu Val Thr Leu Thr Gly Leu Ser He 
435 440 445 

Pro Val Asp Ser Leu Gly Glu Gly Lys Lys Val Val He Ala Ala Ser 
450 455 460 



Ala Ala Ser Lys Asn Val Ala Leu Ser Gly Pro He Leu Leu Leu Asp 
465 470 475 480 

Asn Gin Gly Asn Ala Tyr Glu Asn His Asp Leu Gly Lys Thr Gin Asp 
485 490 495 

Phe Ser Phe Val Gin Leu Ser Ala Leu Gly Thr Ala Thr Thr Thr Asp 
500 505 510 

Val Pro Ala Val Pro Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gin 
515 520 525 

Gly Thr Trp Gly Met Thr Trp Val Asp Asp Thr Ala Ser Thr Pro Lys 
530 535 540 

Thr Lys Thr Ala Thr Leu Ala Trp Thr Asn Thr Gly Tyr Leu Pro Asn 
545 550 555 560 

Pro Glu Arg Gin Gly Pro Leu Val Pro Asn Ser Leu Trp Gly Ser Phe 
565 570 575 
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Ser Asp lie Gin Ala He Gin Gly Val He Glu Arg Ser Ala Leu Thr 
580 585 590 

Leu Cys Ser Asp Arg Gly Phe Trp Ala Ala Gly Val Ala Asn Phe Leu 
595 600 605 

Asp Lys Asp Lys Lys Gly Glu Lys Arg Lys Tyr Arg His Lys Ser Gly 
610 615 620 

Gly Tyr Ala He Gly Gly Ala Ala Gin Thr Cys Ser Glu Asn Leu He 
625 630 635 640 

Ser Phe Ala Phe Cys Gin Leu Phe Gly Ser Asp Lys Asp Phe Leu Val 
645 650 655 

Ala Lys Asn His Thr Asp Thr Tyr Ala Gly Ala Phe Tyr He Gin His 
660 665 670 

He Thr Glu, Cys Ser Gly Phe He Gly Cys Leu Leu Asp Lys Leu Pro 
675 680 685 

Gly Ser Trp Ser His Lys Pro Leu Val Leu Glu Gly Gin Leu Ala Tyr 
690 695 700 

Ser His Val Ser Asn Asp Leu Lys Thr Lys Tyr Thr Ala Tyr Pro Glu 
705 710 715 720 

Val Lys Gly Ser Trp Gly Asn Asn Ala Phe Asn Met Met Leu Gly Ala 
725 730 735 

Ser Ser His Ser Tyr Pro Glu Tyr Leu His Cys Phe Asp Thr Tyr Ala 
740 745 750 

Pro Tyr He Lys Leu Asn Leu Thr Tyr He Arg Gin Asp Ser Phe Ser 
755 760 765 

Glu Lys Gly Thr Glu Gly Arg Ser Phe Asp Asp Ser Asn Leu Phe Asn 
770 775 780 

Leu Ser Leu Pro He Gly Val Lys Phe Glu Lys Phe Ser Asp Cys Asn 
785 790 795 800 

Asp Phe Ser Tyr Asp Leu Thr Leu Ser Tyr Val Pro Asp Leu He Arg 
805 810 815 

Asn Asp Pro Lys Cys Thr Thr Ala Leu Val He Ser Gly Ala Ser Trp 
820 825 830 

Glu Thr Tyr Ala Asn Asn Leu Ala Arg Gin Ala Leu Gin Val Arg Ala 
835 840 845 

Gly Ser His Tyr Ala Phe Ser Pro Met Phe Glu Val Leu Gly Gin Phe 
850 855 860 



Val Phe Glu Val Arg Gly Ser Ser Arg He Tyr Asn Val Asp Leu Gly 
865 870 875 880 
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Gly Lys Phe Gin Phe 
885 



<210> 14 
<211> 928 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 14 

Met Lys Ser Ser Leu His Trp Phe Leu lie Ser Ser Ser Leu Ala Leu 
1 5 10 15 

Pro Leu Ser Leu Asn Phe Ser Ala Phe Ala Ala Val Val Glu lie Asn 

20 25 30 

Leu Gly Pro Thr Asn Ser Phe Ser Gly Pro Gly Thr Tyr Thr Pro Pro 

35 40 45 

Ala Gin Thr Thr Asn Ala Asp Gly Thr lie Tyr Asn Leu Thr Gly Asp 
50 55 60 

Val Ser lie Thr Asn Ala Gly Ser Pro Thr Ala Leu Thr Ala Ser Cys 
65 70 75 80 

Phe Lys Glu Thr Thr Gly Asn Leu Ser Phe Gin Gly His Gly Tyr Gin 
85 90 95 

Phe Leu Leu Gin Asn lie Asp Ala Gly Ala Asn Cys Thr Phe Thr Asn 
100 105 110 

Thr Ala Ala Asn Lys Leu Leu Ser Phe Ser Gly Phe Ser Tyr Leu Ser 
115 120 125 

Leu lie Gin Thr Thr Asn Ala Thr Thr Gly Thr Gly Ala lie Lys Ser 
130 135 140 

Thr Gly Ala Cys Ser lie Gin Ser Asn Tyr Ser Cys Tyr Phe Gly Gin 
145 150 155 160 

Asn Phe Ser Asn Asp Asn Gly Gly Ala Leu Gin Gly Ser Ser lie Ser 
165 170 175 

Leu Ser Leu Asn Pro Asn Leu Thr Phe Ala Lys Asn Lys Ala Thr Gin 
180 185 190 

Lys Gly Gly Ala Leu Tyr Ser Thr Gly Gly lie Thr lie Asn Asn Thr 
195 200 205 

Leu Asn Ser Ala Ser Phe Ser Glu Asn Thr Ala Ala Asn Asn Gly Gly 
210 215 220 

Ala lie Tyr Thr Glu Ala Ser Ser Phe lie Ser Ser Asn Lys Ala lie 
225 230 235 240 
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Ser Phe lie Asn Asn Ser Val Thr Ala Thr Ser Ala Thr Gly Gly Ala 
245 250 255 

lie Tyr Cys Ser Ser Thr Ser Ala Pro Lys Pro Val Leu Thr Leu Ser 
260 265 270 

Asp Asn Gly Glu Leu Asn Phe lie Gly Asn Thr Ala lie Thr Ser Gly 
275 280 285 

Gly Ala lie Tyr Thr Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr 
290 295 300 

Leu Phe Lys Asn Asn Ser Gly Tyr Asp Thr Ala Ala Pro Leu Gly Gly 
305 310 315 320 

Ala lie Ala lie Ala Asp Ser Gly Ser. Leu Ser Leu Ser Ala Leu Gly 
325 330 335 

Gly Asp lie Thr Phe Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser 
340 345 350 

Ser Gin Thr Thr Thr Arg Asn Ser lie Asn lie Gly Asn Thr Asn Ala 
355 360 365 

Lys He Val Gin Leu Arg Ala Ser Gin Gly Asn Thr He Tyr Phe Tyr 
370 375 380 

Asp Pro He Thr Thr Ser He Thr Ala Ala Leu Ser Asp Ala Leu Asn 
385 390 ' 395 400 

Leu Asn Gly Pro Asp Leu Ala Gly Asn Pro Ala Tyr Gin Gly Thr He 
405 410 415 

Val Phe Ser Gly Glu Lys Leu Ser Glu Ala Glu Ala Ala Glu Ala Asp 
420 425 430 

Asn Leu Lys Ser Thr He Gin Gin Pro Leu Thr Leu Ala Gly Gly Gin 
435 440 445 

Leu Ser Leu Lys Ser Gly Val Thr Leu Val Ala Lys Ser Phe Ser Gin 
450 455 460 

Ser Pro Gly Ser Thr Leu Leu Met Asp Ala Gly Thr Thr Leu Glu Thr 
465 470 475 480 

Ala Asp Gly He Thr lie Asn Asn Leu Val Leu Asn Val Asp Ser Leu 
485 490 495 

Lys Glu Thr Lys Lys Gly Thr Leu Lys Ala Thr Gin Ala Ser Gin Thr 
500 505 510 

Val Thr Leu Ser Gly Ser Leu Ser Leu Val Asp Pro Ser Gly Asn Val 
515 520 525 



Tyr Glu Asp Val Ser Trp Asn Asn Pro Gin Val Phe Ser Cys Leu Thr 
530 535 540 
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Leu Thr Ala Asp Asp Pro Ala Asn lie His He Thr Asp Leu Ala Ala 
545 550 555 560 

Asp Pro Leu Glu Lys Asn Pro He His Trp Gly Tyr Gin Gly Asn Trp 
565 570 575 

Ala Leu Ser Trp Gin Glu Asp Thr Ala Thr Lys Ser Lys Ala Ala Thr 
580 585 590 

Leu Thr Trp Thr Lys Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Gly 
595 600 605 

Thr Leu Val Ala Asn Thr Leu Trp Gly Ser Phe Val Asp Val Arg Ser 
610 615 620 

lie Gin Gin Leu Val Ala Thr Lys Val Arg Gin Ser Gin Glu Thr Arg 
625 630 635 640 

Gly He Trp Cys Glu Gly He Ser Asn Phe Phe His Lys Asp Ser Thr 
645 650 655 

Lys He Asn Lys Gly Phe Arg His He Ser Ala Gly Tyr Val Val Gly 
660 665 670 

Ala Thr Thr Thr Leu Ala Ser Asp Asn Leu He Thr Ala Ala Phe Cys 
675 680 685 

Gin Leu Phe Gly Lys Asp Arg Asp His Phe He Asn Lys Asn Arg Ala 
690 695 700 

Ser Ala Tyr Ala Ala Ser Leu His Leu Gin His Leu Ala Thr Leu Ser 
705 710 715 720 

Ser Pro Ser Leu Leu Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro 
725 730 735 

Val Leu Phe Asp Ala Gin He Ser Tyr He Tyr Ser Lys Asn Thr Met 
740 745 750 

Lys Thr Tyr Tyr Thr Gin Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn 
755 760 765 

Asp Gly Cys Ala Leu Glu Leu Ala Ser Ser Leu Pro His Thr Ala Leu 
770 775 780 

Ser His Glu Gly Leu Phe His Ala Tyr Phe Pro Phe He Lys Val Glu 
785 790 795 800 

Ala Ser Tyr He His Gin Asp Ser Phe Lys Glu Arg Asn Thr Thr Leu 
805 810 815 

Val Arg Ser Phe Asp Ser Gly Asp Leu He Asn Val Ser Val Pro He 
820 825 830 

Gly He Thr Phe Glu Arg Phe Ser Arg Asn Glu Arg Ala Ser Tyr Glu 
835 840 845 
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Ala Thr Val lie Tyr Val Ala Asp Val Tyr Arg Lys Asn Pro Asp Cys 
850 855 860 

Thr Thr Ala Leu Leu lie Asn Asn Thr Ser Trp Lys Thr Thr Gly Thr 
865 870 875 880 

Asn Leu Ser Arg Gin Ala Gly He Gly Arg Ala Gly He Phe Tyr Ala 
885 890 895 

Phe Ser Pro Asn Leu Glu Val Thr Ser Asn Leu Ser Met Glu He Arg 
900 905 910 

Gly Ser Ser Arg Ser Tyr Asn Ala Asp Leu Gly Gly Lys Phe Gin Phe 
915 920 925 



<210> 15 
<211> 930 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 15 

Met Lys He Pro Leu His Lys Leu Leu He Ser Ser Thr Leu Val Thr 
1 5 10 15 

Pro He Leu Leu Ser He Ala Thr Tyr Gly Ala Asp Ala Ser Leu Ser 
20 25 30 

Pro Thr Asp Ser Phe Asp Gly Ala Gly Gly Ser Thr Phe Thr Pro Lys 
35 40 45 

Ser Thr Ala Asp Ala Asn Gly Thr Asn Tyr Val Leu Ser Gly Asn Val 
50 55 60 

Tyr He Asn Asp Ala Gly Lys Gly Thr Ala Leu Thr Gly Cys Cys Phe 
65 70 75 80 

Thr Glu Thr Thr Gly Asp Leu Thr Phe Thr Gly Lys Gly Tyr Ser Phe 
85 90 95 

Ser Phe Asn Thr Val Asp Ala Gly Ser Asn Ala Gly Ala Ala Ala Ser 
100 105 HO 

Thr Thr Ala Asp Lys Ala Leu He Phe Thr Gly Phe Ser Asn Leu Ser 
115 120 125 

Phe lie Ala Ala Pro Gly Thr Thr Val Ala Ser Gly Lys Ser Thr Leu 
130 135 140 

Ser Ser Ala Gly Ala Leu Asn Leu Thr Asp Asn Gly Thr He Leu Phe 
145 150 155 160 

Ser Gin Asn Val Ser Asn Glu Ala Asn Asn Asn Gly Gly Ala He Thr 
165 170 175 
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Thr Lys Thr Leu Ser lie Ser Gly Asn Thr Ser Ser He Thr Phe Thr 
180 185 190. 

Ser Asn Ser Ala Lys Lys Leu Gly Gly Ala lie Tyr Ser Ser Ala Ala 
195 200 205 

Ala Ser He Ser Gly Asn Thr Gly. Gin Leu Val Phe Met Asn Asn Lys 
210 215 220 

Gly Glu Thr Gly Gly Gly Ala Leu Gly Phe Glu Ala Ser Ser Ser He 
225 230 235 240 

Thr Gin Asn Ser Ser Leu Phe Phe Ser Gly Asn Thr Ala Thr Asp Ala 
245 250 255 

Ala Gly Lys Gly Gly Ala He Tyr Cys Glu Lys Thr Gly Glu Thr Pro 
260 265 270 

Thr Leu Thr He Ser Gly Asn Lys Ser Leu Thr Phe Ala Glu Asn Ser 
275 280 285 

Ser Val Thr Gin Gly Gly Ala He Cys Ala His Gly Leu Asp Leu Ser 
290 295 300 

Ala Ala Gly Pro Thr Leu Phe Ser Asn Asn Arg Cys Gly Asn Thr Ala 
305 310 315 320 

Ala Gly Lys Gly Gly Ala lie Ala He Ala Asp Ser Gly Ser Leu Ser 
325 330 335 

Leu Ser Ala Asn Gin Gly Asp He Thr Phe Leu Gly Asn Thr Leu Thr 
340 345 350 

Ser Thr Ser Ala Pro Thr Ser Thr Arg Asn Ala He Tyr Leu Gly Ser 
355 360 365 

Ser Ala Lys He Thr Asn Leu Arg Ala Ala Gin Gly Gin Ser lie Tyr 
370 375 380 

Phe Tyr Asp Pro He Ala Ser Asn Thr Thr Gly Ala Ser Asp Val Leu 
385 390 395 400 

Thr lie Asn Gin Pro Asp Ser Asn Ser Pro Leu Asp Tyr Ser Gly Thr 
405 410 415 

lie Val Phe Ser Gly Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala 
420 425 430 

Asp Asn Phe Thr Ser lie Leu Lys Gin Pro Leu Ala Leu Ala Ser Gly 
435 440 445 

Thr Leu Ala Leu Lys Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr 
450 455 460 

Gin Thr Glu Gly Ser Thr Leu Leu Met Gin Pro Gly Thr Lys Leu Lys 
465 470 475 480 
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Ala Asp Thr Glu Ala lie Ser Leu Thr Lys Leu Val Val Asp Leu Ser 
485 490 * 495 

Ala Leu Glu Gly Asn Lys Ser Val Ser lie Glu Thr Ala Gly Ala Asn 
500 505 510 

Lys Thr lie Thr Leu Thr Ser Pro Leu Val Phe Gin Asp Ser Ser Gly 
515 520 525 

Asn Phe Tyr Glu Ser His Thr lie Asn Gin Ala Phe Thr Gin Pro Leu 
530 535 540 

Val Val Phe Thr Ala Ala Thr Ala Ala Ser Asp lie Tyr He Asp Ala 
545 550 555 560 

Leu Leu Thr Ser Pro Val Gin Thr Pro Glu Pro His Tyr Gly Tyr Gin 
565 570 575 

Gly His Trp Glu Ala Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly 
580 585 590 

Thr Met Thr Trp Val Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg 
595 600 605 

Ala Ser Val Val Pro Asp Ser Leu Trp Ala Ser Phe Thr Asp He Arg 
610 615 620 

Thr Leu Gin Gin He Met Thr Ser Gin Ala Asn Ser He Tyr Gin Gin 
625 630 635 640 

Arg Gly Leu Trp Ala Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys 
645 650 655 

Ser Gly Thr Asn Gin Ala Phe Arg His Lys Ser Tyr Gly Tyr He Val 
660 665 670 

Gly Gly Ser Ala Glu Asp Phe Ser Glu Asn He Phe Ser Val Ala Phe 
675 680 685 

Cys Gin Leu Phe Gly Lys Asp Lys Asp Leu Phe He Val Glu Asn Thr 
690 695 700 

Ser His Asn Tyr Leu Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu 
705 7.10 715 720 

Gly Gly Leu Pro Met Pro Ser Phe Gly Ser He Thr Asp Met Leu Lys 
725 730 . 735 

Asp He Pro Leu He Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys 
740 745 750 

Asn Asp Met Asp Thr Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser 
755 760 765 

Trp Thr Asn Asn Ser Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu 
770 775 780 
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Tyr Leu Pro Lys Glu Ala Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu 
785 790 795 800 

Lys Phe Gin Ala Val Tyr Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly 
805 810 815 

Ala Glu Ala Arg Ala Phe Asp Asp Gly Asp Leu Val Asn Cys Ser lie 
820 825 830 

Pro Val Gly lie Arg Leu Glu Lys lie Ser Glu Asp Glu Lys Asn Asn 
835 840 845 

Phe Glu lie Ser Leu Ala Tyr lie Gly Asp Val Tyr Arg Lys Asn Pro 
850 855 860 

Arg Ser Arg Thr Ser Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu 
865 870 875 880 

Cys Lys Asn Leu Ala Arg Gin Ala Phe Leu Ala Ser Ala Gly Ser His 
885 890 895 

Leu Thr Leu Ser Pro His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu 
900 905 910 

Leu Arg Gly Ser Ala His He Tyr Asn Val Asp Cys Gly Leu Arg Tyr 
915 920 925 

Ser Phe 
930 



<210> 16 
<211> 293 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 16 

Met Leu Ser Ser Leu He Arg Asp Ser Phe Pro Leu Leu He Leu Leu 
15 10 15 

Pro Thr Phe Leu Ala Ala Leu Gly Ala Ser Val Ala Gly Gly Val Met 
20 25 30 

Gly Thr Tyr lie Val Val Lys Arg He Val Ser He Ser Gly Ser He 
35 40 45 

Ser His Ala He Leu Gly Gly He Gly Leu Thr Leu Trp He Gin Tyr 
50 55 60 

Lys Leu His Leu Ser Phe Phe Pro Met Tyr Gly Ala He Val Gly Ala 
65 70 75 80 

He Phe Leu Ala Leu Cys He Gly Lys He His Leu Lys Tyr Gin Glu 



85 



90 



95 
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Arg Glu Asp Ser Leu lie Ala Met lie Trp Ser Val Gly Met Ala He 
100 105 110 

Gly lie He Phe He Ser Arg Leu Pro Thr Phe Asn Gly Glu Leu He 
115 120 125 

Asn Phe Leu Phe Gly Asn He Leu Trp Val Thr Pro Ser Asp Leu Tyr 
130 135 140 

Ser Leu Gly He Phe Asp Leu Leu Val Leu Gly He Val Val Leu Cys 
145 150 155 160 

His Thr Arg Phe Leu Ala Leu Cys Phe Asp Glu Arg Tyr Thr Ala Leu 
165 170 175 

Asn His Cys Ser Val Gin Leu Trp Tyr Phe Leu Leu Leu Val Leu Thr 
180 185 190 

Ala He Thr He Val Met Leu lie Tyr Val Met Gly Thr He Leu Met 
195 200 205 

Leu Ser Met Leu Val Leu Pro Val Ala He Ala Cys Arg Phe Ser Tyr 
210 215 220 

Lys Met Thr Arg He Met Phe He Ser Val Leu Leu Asn He Leu Cys 
225 230 235 240 

Ser Phe Ser Gly He Cys He Ala Tyr Cys Leu Asp Phe Pro Val Gly 
245 250 255 

Pro Thr He Ser Leu Leu Met Gly Leu Gly Tyr Thr Ala Ser Leu Cys 
260 265 270 

Val Lys Lys Arg Tyr Asn Pro Ser Thr Pro Ser Pro Val Ser Pro Glu 
275 280 285 

He Asn Thr Asn Val 
290 
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